Explore Whisper AI: Simplify Transcription Effortlessly
Discover Whisper AI's speed and accuracy for transcribing audio, podcasts, and songs. Perfect for content creators, students, and professionals.
File
HOW TO Use WHISPER AI TRANSCRIPTION To Transcribe ANYTHING
Added on 01/29/2025
Speakers
add Add new speaker

Speaker 1: Today we are going over Whisper AI. This isn't new, but I've been itching to cover it. Why? Because I truly believe it's underrated and often overlooked. Now, this tool can transcribe practically anything. I put it to the test with AI voices, podcasts, and even full songs with lightning fast lyrics. And guess what? Whisper AI didn't just keep up, it excelled. It's seriously amazing, and in this video, I'll walk you through how to install it with just one click. That's right, just one click. Before diving in, let's chat about what we're going to do. We'll install it together and then run some tests. We're talking transcribing different languages, testing the translation to English feature, and running a few transcribes on various types of songs. And for those who might hit a snag trying to run it locally, don't worry, I've got your back. I'll show you how to use this gem online. And let me tell you, Whisper AI is fast. When I say fast, I mean, it transcribed and spat out the lyrics for a song in just 18 seconds. Now, you might be thinking, so what? What's the big deal with converting speech to text? Well, let me tell you, the applications are vast and diverse. Think about content creators, like podcasters and YouTubers. They can use Whisper AI to generate accurate subtitles, making their content accessible to a wider audience. Or picture students and researchers transcribing lectures and interviews, making note-taking a breeze and studying more efficient. And it's not just for the education field. Business professionals can transcribe meetings and conferences, journalists can swiftly turn interviews into text, and even legal pros can document proceedings accurately. But here's a cool one, language learners. Imagine using this tool to transcribe audio in a new language. It's like having a personal tutor helping you with comprehension and pronunciation. And let's not forget about accessibility. Whisper AI is a game-changer for those with hearing impairments, providing real-time text of spoken content. The list goes on. From healthcare professionals documenting sessions to authors transcribing their spoken ideas, Whisper AI is reshaping how we handle speech and text. So let's jump right in and install this powerful tool. First you'll need Pinocchio. So go to Pinocchio.computer and install and launch Pinocchio. Next, click on the discover button and look for Whisper WebWeb. Click download and install. The install process took me around 10 minutes. It's a one-click installer so you don't need to worry about command prompts or anything like that so sit back and let it install. Once it's installed click on launch. Now the fun part, let's go over some settings. By default we start off in file. This is where you can drop your files that you have on your computer. If it's already downloaded you'll drop them here. Over here is the models. They have a whole variety of models to choose from. By default it's set to large V2. That's the default model. For language, default is set to automatic detection. It supports a whole variety of languages. And for file format, SRT is a subtitle format. Then we got web, VTT, and TXT file. I normally use the TXT file. We got the option to translate to English. Add a timestamp to the end of the file name. Here we have advanced perimeters. Not gonna go over too much of the advanced perimeters. You'll just generate your file by clicking on this button after you've uploaded your file. So they have a section here called YouTube. And you can simply paste your YouTube link and it'll transcribe the video. The same here. The models. We have large V2 selected by default. Then we have automatic detection on our language and our file format, TXT. Translate to English. Add timestamp to the end of the file name. And of course advanced perimeters. The next tab we have the microphone. This is where you can record directly from your microphone. If you want to translate your speech to text, same here. The model selection, language, file formats. Translate to English, advanced perimeters, and of course your generate button. So here we have T2T translation. T2T translation is to upload your subtitle files to translate here. So if you have subtitle files, you can upload them here and translate them. We have our source language and our target language. So this is where you would translate languages. If you wanted to translate some subtitles, so let's say the movie had Spanish subtitles, you could upload the SRT file and translate it to English. So with that all being said, let's take this for a test drive. The first thing I'll do is give it a simple test. This is my voice over file that I will translate. It's actually me voicing over the rundown of Whisper AI that you previously seen earlier. All I did was narrate through my mic while recording my screen. I then used Whisper AI to transcribe the video to text then I ran it through the AI voice that you are used to hearing, this voice. So here is a preview of the file I'll transcribe.

Speaker 2: By default we start off in file. This is where you can drop your files that you have on your computer. If it's already downloaded, you'll drop them here. Over here is the models. They have a whole variety of models to choose from. By default it's set to large V2, that's the default model. For language, default is set to automatic detection.

Speaker 1: So that is the file we will transcribe. Let's run it through Whisper. Done in 12 seconds. So this is what it would look like as a SRT format or subtitle file. If we wanted just the text, we could change the file format here by going here and selecting TXT. So let's do that. Now we have just the text file. Again, it took 12 seconds to translate that. So for the next test, I am going to give it something more challenging. Something with background sound effects and music. I'll use one of my old short form videos. This was my last short that I uploaded so let's try this. It's perfect because it's just a voice, then there are some whoosh sound effects then music starts with the narration. Let's see how long Whisper takes to transcribe this. First let's preview the video. I took this image and converted it into a video with this new AI tool. The problem is, the video only generates for second videos so this is what I did. Go to this website, then click here. Next click here. Drag and drop. Ok let's run this through Whisper. Let's use the YouTube tab on Whisper and I'll grab my link here and paste it here. Click generate. Done in 4 seconds and it did a perfect job with the transcription even with the sound effects and music in the background. Now let's try something a bit more challenging. Let's see if Whisper AI can transcribe a song. So in order for me to not get a copyright strike on my videos, I'll be using something from Nefix.

Speaker 3: So let's go to YouTube and find a Nefix no copyright song.

Speaker 1: Ok let's test this out. Let me grab the YouTube link and copy it and head back over to Whisper.

Speaker 3: Paste in our YouTube link now click generate.

Speaker 1: That took 57 seconds to transcribe. Simply amazing. Now I am going to try it with a fast rap to see if Whisper can transcribe it. Of course I can't do anything mainstream or I'll get a copyright strike here on YouTube so I'll just use another Nefix song since it is copyright free. For this song I'll use the song R.I.P.

Speaker 3: So let's fire it up and see how this goes.

Speaker 1: Nice, done in 24 seconds. Awesome. Now what can you do if you can't run this locally? Well, we can head over to OpenAI.com. However it won't be free, but it's really cheap, I'm talking a few cents per transcribe so. Let's head over to OpenAI. When you're done you'll want to go to the API section. From here we can access the Playground. This is where we will access the OpenAI Whisper model to transcribe files online. When you're at the Playground, click on Assistance and select Complete from the drop down menu. Now you will see a green microphone icon here. Click on that green mic icon and you'll be able to drag and drop your files here to transcribe. And it's as simple as that. Need subtitles for your videos? Done. Want to document a brainstorming session without missing a beat? Whisper AI has got you covered. But what's truly amazing is seeing this AI work its magic in real time. It's not just a tool, it's like a reliable partner that understands and transcribes every word with precision. And the best part? It's free and easy to use. You saw how simple the installation process is and how quickly you can start transcribing anything and everything. So why not give it a try? See for yourself how Whisper AI can make your life easier, your work more efficient, and your content more accessible. Don't forget to hit like, share, and subscribe for more AI tech tips and tricks. Until next time, keep exploring and keep innovating. AI Controversy signing out.

ai AI Insights
Summary

Generate a brief summary highlighting the main points of the transcript.

Generate
Title

Generate a concise and relevant title for the transcript based on the main themes and content discussed.

Generate
Keywords

Identify and highlight the key words or phrases most relevant to the content of the transcript.

Generate
Enter your query
Sentiments

Analyze the emotional tone of the transcript to determine whether the sentiment is positive, negative, or neutral.

Generate
Quizzes

Create interactive quizzes based on the content of the transcript to test comprehension or engage users.

Generate
{{ secondsToHumanTime(time) }}
Back
Forward
{{ Math.round(speed * 100) / 100 }}x
{{ secondsToHumanTime(duration) }}
close
New speaker
Add speaker
close
Edit speaker
Save changes
close
Share Transcript