20,000+ Professional Language Experts Ready to Help. Expertise in a variety of Niches.
Unmatched expertise at affordable rates tailored for your needs. Our services empower you to boost your productivity.
GoTranscript is the chosen service for top media organizations, universities, and Fortune 50 companies.
Speed Up Research, 10% Discount
Ensure Compliance, Secure Confidentiality
Court-Ready Transcriptions
HIPAA-Compliant Accuracy
Boost your revenue
Streamline Your Team’s Communication
We're with you from start to finish, whether you're a first-time user or a long-time client.
Give Support a Call
+1 (831) 222-8398
Get a reply & call within 24 hours
Let's chat about how to work together
Direct line to our Head of Sales for bulk/API inquiries
Question about your orders with GoTranscript?
Ask any general questions about GoTranscript
Interested in working at GoTranscript?
Speaker 1: In today's video we are going to be seeing how to transcribe or convert an audio to text automatically using artificial intelligence, so to speak. Why would we need this? In this channel you have seen the videos of working in Gotranscript or any type of company that works in the same way. Or simply for ourselves if we have the need for some audio that we have out there and we would like to take it to text or have it transcribed and we do not want to do that manually or pay a person to do it, it is very easy to do it and I am going to show you the way I do it totally free. So, without further ado, let's see the tutorial. And well, here we are in Office 365, which is what we are going to use specifically in Word. I am going to do it in the web part, that is, from the browser, you could do it from the desktop application, it works the same. And for this I am simply going to open a new document here, there we wait for the new document to open, as you can see here it says document 2, a title of any, there we could change it. For the purpose of this I am going to put test, for example, I click outside, it is already updated, that is the name of the document. So here, as we can see, obviously it is blank, I am going to put this a little bigger. And here is where the part that we are going to work on is going to be. In the first option, we have the option to dictate what we have, that we could speak directly and he would write it. Or here, transcribe, this is what we are looking for. When we click here, the first thing he is going to tell us is, well, he is going to give us the specifications of the audio formats that he can use to transcribe. That is, mp4, which would be a video, that is, different video formats, mp3 and m4a, which is the format generally recorded by the iPhone, that is, iOS. An iPhone, an iPad, the same Mac. Then there would be the language, the audio, we have it either in English or in Spanish or in any other language. But even in Spanish, here it says, look, Spanish, Spain. Maybe there are other Spanish around here, as you can see, the Dominican Republic, which is where I am, Cuba, Costa Rica. Because, as we know, although we speak Spanish, each country has some specific rules. I'm going to change it here and I'm going to give it where it says upload video. We could start a recording. There the difference with dictado is that in dictado we can see the text being written in real time, as we speak. Here we would make a complete recording and then process it. We are going to do it first by loading the audio and then we would do this part for those who want to see it also working. We're just going to load it. Here I have a test one that I recorded a moment ago to be able to use it. I select it, I open it, and there, as you can see, the first thing it's going to do is load the audio. That is, upload it to, in this case, OneDrive, which is the cloud of Microsoft, and then also process it. So, there we are going to wait for it to reach 100%. It's almost there, 95, 97. So, look at it here. And there it finished. And if you look, it's giving me the time in which each section starts and is being spoken or narrated. And it also tells me the speaker, that is, speaker 1. If there were, as it is a monologue, basically, that's what I'm talking about. There is only one speaker. If there were several people in the conversation, it also identifies them with the time in which each person speaks. We can handle that here when we click on add to document. If you look, nothing is added. This is like a preview. And it gives us the option to only add the text, add it with the speakers, add them with a time mark or with the speakers and the time marks, which is how it is here. It's as complete as possible. That is, it gives us options. If we want to see how correct the transcription is, because it is not always going to be 100%, since this is artificial intelligence. But it should be the closest. We are going to play it and read to find out if it actually makes mistakes that we could then correct. Well, this is a test so that we can see in a video how good or how bad the translation is. For now, it should appear as a single interlocutor. Maybe because I'm just talking and it's a continuous and constant dialogue. So I understand that it should not interpret any voice or speech changes. The point is that we are going to see how the tool handles it. And if it gives us some pauses, some commas. And what else could we see happen with a person who speaks normally as I am speaking. And blah, blah, blah, blah, blah. And that's it. Let's leave it there. Goodbye. Well, if you noticed, it was not 100%. There I also have a bit of a fault because there were parts that I said in English. For example, here is a maybe. Over here. I lost it. Okay, this maybe. There at least he wrote it well as it goes in English. But here I said so. And he put solo, which was like the closest thing he understood. Here trees too. So I said I don't know. I think it was I don't know. And he put trees. Which is not bad. It's like the closest thing he could understand. But if you look, I saved myself here the time to write all this manually. With all this information. If I had only wanted the text. Here, look how I would have put it. With the text only. Or if I had wanted it only with speakers. Again, here with the speakers. Blah, blah, blah. There are the different options. I'm going to delete all this. We have here. I'm going to remove here. And if I had said it, transcribe. But, wow. Okay, it's okay. New transcription. Accept. And I tell him to start recording. Allow. Permit access to the microphone. So there it is. Automatically it is already recording. So let's wait at least a couple of seconds more. To stop it. And see how it is recording it. If you look, it is not writing anything on the screen. That's going to be after we finish. And process all the text. So let's save and transcribe now. Well, if you look. There it is. It is loading that audio. It was much faster. And if we see it here. Let's add it directly. Complete. And let's play it. So there it is. Automatically it is already recording. So let's wait at least a couple of seconds more. To stop it. And see how it is recording it. If you look. It is not writing anything on the screen. That's going to be after we finish. And process all the text. So let's save and transcribe now. If you look. There it was a little better. Because I only said a show. That is, it would be in English. And he obviously does not put it. But this is the way I do it. And you could use it. Simply using Word. Which is a tool that. The vast majority of people know. And nothing. I'm going to leave it here. If you have questions or would like to see. Something additional. Leave it to me as always in the comments of the video. From my part, this would be all. See you. In the next. Goodbye.
Generate a brief summary highlighting the main points of the transcript.
GenerateGenerate a concise and relevant title for the transcript based on the main themes and content discussed.
GenerateIdentify and highlight the key words or phrases most relevant to the content of the transcript.
GenerateAnalyze the emotional tone of the transcript to determine whether the sentiment is positive, negative, or neutral.
GenerateCreate interactive quizzes based on the content of the transcript to test comprehension or engage users.
GenerateWe’re Ready to Help
Call or Book a Meeting Now