Xenova's Multilingual Real-Time Demo Amazes Users
Experience real-time multilingual translation with Xenova's in-browser demo using WhisperBase and Hugging Face, even on low-spec devices. A game-changer for presentations.
File
Realtime in-browser AI speech-to-text with OpenAI Whisper and HuggingFace Transformers.js
Added on 01/29/2025
Speakers
add Add new speaker

Speaker 1: So, you might have seen this absolute wild demo by Xenova from the Hugging Face team, which actually does real-time in-browser offline speech transcription is absolutely wild. It's great. It's using the WhisperBase model. Now, I just played around with this and wanted to have some fun. So, I have it here, it's WhisperBase. You're basically pulling down the model the first time you're loading this from Hugging Face. But now, the thing I want is to be able to translate this in real-time as well as we're getting this in. Now, luckily enough, Xenova actually has a demo as well that does multilingual translation again with Transformers.js. So, I was like, okay, let's just take SuperBase real-time. So, we can do SuperBase real-time broadcast, and basically just broadcast this to our app here where we can then translate it. So, now, this actually becomes fun is we're switching to a different language. Yeah. Now, so this is running locally on my pretty old MacBook Air M1. So, it's not that great, but we can also try this. Yeah. Okay. So, the problem is, the Latin languages do work a lot better. This is still really, really impressive stuff considering I have such a low spec MacBook. So, yeah, really a lot of cool things that you can do with Hugging Face, Transformers.js. Like SuperBase real-time, just kind of a cool little demo. Do let me know if that would be helpful for you. My idea is that when you're doing a presentation, you can just share this URL with the attendees in the audience, and then if they, for example, speak French, then they can say, I want to know what this speaker is saying in French, and then they can just select their preferred language. There we go, and they see what is being spoken about. So, yeah, pretty wild stuff all happening in the browser. Incredible things. So, do follow Xen over here, working on some incredible demos. You can see this obviously, 250,000 views. So, yeah, incredible stuff. Thanks for tuning in, and let me know what you're playing around with. Cheers.

ai AI Insights
Summary

Generate a brief summary highlighting the main points of the transcript.

Generate
Title

Generate a concise and relevant title for the transcript based on the main themes and content discussed.

Generate
Keywords

Identify and highlight the key words or phrases most relevant to the content of the transcript.

Generate
Enter your query
Sentiments

Analyze the emotional tone of the transcript to determine whether the sentiment is positive, negative, or neutral.

Generate
Quizzes

Create interactive quizzes based on the content of the transcript to test comprehension or engage users.

Generate
{{ secondsToHumanTime(time) }}
Back
Forward
{{ Math.round(speed * 100) / 100 }}x
{{ secondsToHumanTime(duration) }}
close
New speaker
Add speaker
close
Edit speaker
Save changes
close
Share Transcript