20,000+ Professional Language Experts Ready to Help. Expertise in a variety of Niches.
Unmatched expertise at affordable rates tailored for your needs. Our services empower you to boost your productivity.
GoTranscript is the chosen service for top media organizations, universities, and Fortune 50 companies.
Speed Up Research, 10% Discount
Ensure Compliance, Secure Confidentiality
Court-Ready Transcriptions
HIPAA-Compliant Accuracy
Boost your revenue
Streamline Your Team’s Communication
We're with you from start to finish, whether you're a first-time user or a long-time client.
Give Support a Call
+1 (831) 222-8398
Get a reply & call within 24 hours
Let's chat about how to work together
Direct line to our Head of Sales for bulk/API inquiries
Question about your orders with GoTranscript?
Ask any general questions about GoTranscript
Interested in working at GoTranscript?
Speaker 1: Hello Internet. Deepgram is back with another update. See last time, we showcased our brand new AI speech recognition model, Nova. Link to that video is in the description. However, this time we're unveiling something a little different. It's a feature that many of us techies need, but is hard to find. That of course is speaker labeling, otherwise known as diarization. See an AI can transcribe spoken words, sure, but training an AI to recognize different voices in a multi-person conversation is an entirely different story. However, Deepgram has your back. The latest version of our speaker labeling feature is here, and it's better than ever. So instead of your transcripts looking like this, they can now look like this. Isn't that much better? And the best part? We've made it as simple as possible for you to implement this. Of course, you can check out our SDKs and our user-friendly docs to learn all you need to know. But here at Deepgram, we like to go the extra mile, so we've written a Python notebook for you. Link in the description. Using it is actually pretty easy. Just copy the notebook, upload any audio you want, and then run the cells. All the instructions that you need are already written inside the notebook. Follow those instructions and bam, you should have an output that looks like this. A speaker labeled transcript appears for you in seconds. But alright, if you're interested in what the code really says, let's break it down. This first cell simply installs dependencies. After all, you can't transcribe audio with Deepgram's models without first installing Deepgram itself. This second cell contains no code, but it does remind you to upload the audio of your choice into the notebook. Here I'm using a podcast excerpt. And this next cell is where the magic happens. There's only three variables you need to fill out first though. Number one, plug in your Deepgram API key here. To get one, just sign up for Deepgram. With every sign up, you'll receive up to 45,000 minutes of transcription for free. Alright, now that you have your API key, we'll have to specify, number two, the MIME type. In this case, my audio is in M4A format. But if you're using an MP3 or a WAV or any of these other formats on screen, you should be golden. Finally, I'm specifying the directory that my audios are in. Here, both my code and my audio files live in the current directory, so a simple dot should suffice. And now, I'm all set. By running this cell, Deepgram's API will now transcribe every audio in the directory I specified whose MIME type matches the MIME type I specified. The output is a beautiful, pretty printed JSON that looks like this. You can delve into the shape of this JSON if you wish, but if you simply want to extract a speaker-labeled transcript from this nested structure, all you have to do is run the final cell. Here, we've written a LeetCode-style function that will parse every JSON file in the directory that you specified in the previous cell. Then it'll output a TXT file containing your speaker-labeled transcript. And voila, you're done. Here's what my output looks like.
Speaker 2: You know, the first week I found it worked like 85% of the time, then it dropped down to about 66% of the time. So whatever it is, even if it's only 85%, that's still not enough for you to take your other cards out of your wallet. For me, I think I'm hovering around 5%. That's awful. You're not saving time at all. No. I really doubt that, because I can't imagine you trying something 20 times. What would inspire me to keep trying it? Well, anger. That's true. That's true.
Speaker 1: That's true. And here's the output that our friends at NASA got when they transcribed the audio of the first all-female spacewalk.
Speaker 3: My pleasure working with you this morning, and I'm working on getting that EV hatch open, and I can report it's opened and disposed. Thank you, Drew. Thank you so much. Okay. On your GCMs, take your power switches to back. Stagger switch throws and expect a warning tone.
Speaker 1: And now that you know how to diarize transcripts, go forth and get creative. For some inspiration, here's what we've seen done in the past. Call centers can use speaker labeling to distinguish between their employees and customers for every given phone call. Then topic detection or sentiment analysis can be applied to see what customers speak about the most and how they feel about those topics. Podcasters, meanwhile, have used speaker labeling to create closed captions for their content. And of course, we here at DeepGram Studios have created videos in which we use our diarization feature to create some fun analyses. For example, in this video, link in the description, we analyze just how much famous late night talk show hosts really let their guests speak. And that's only the beginning of what you can do with AI speech recognition. Truly, the sky and your own creativity are the limit. And if you have a fun demo that you'd like to showcase, hit us up on our socials, whether that's on Twitter, TikTok, LinkedIn, or even in the comment section below. Here are some satisfied DeepGram customers that we've featured in the past. Their code is quite impressive. Check it out. Link in the description, as usual. And as always, follow DeepGram for more AI content.
Generate a brief summary highlighting the main points of the transcript.
GenerateGenerate a concise and relevant title for the transcript based on the main themes and content discussed.
GenerateIdentify and highlight the key words or phrases most relevant to the content of the transcript.
GenerateAnalyze the emotional tone of the transcript to determine whether the sentiment is positive, negative, or neutral.
GenerateCreate interactive quizzes based on the content of the transcript to test comprehension or engage users.
GenerateWe’re Ready to Help
Call or Book a Meeting Now