Universal 3.5 Pro Streaming adds Context Carryover (Full Transcript)

New flagship speech-to-text model uses recent dialogue context to disambiguate short answers automatically—especially useful for voice agents.
Download Transcript (DOCX)
Speakers
add Add new speaker

[00:00:00] Speaker 1: Today we launched Universal 3.5 Pro Streaming, our latest flagship speech-to-text model. The feature that I'm most excited about is Context Carryover. Here's the problem that it solves. Most transcription models hear each moment on its own, with no memory of what came before it. So when someone gives a short answer like C, there's nothing for the model to disambiguate it with, and it just guesses. Context Carryover fixes that automatically. The model remembers the last few things that were said, and uses them as context for whatever comes next. Let me use a practical example to show you what I mean. I can compare Universal 3.5 Pro with our older speech-to-text model here. And if I ask a question like, Do you speak Spanish? C. Did you arrive here by land, air, or sea? C. Do you like option A, B, or C? C. So as you can see, the model takes each question as context for the answer, and transcribes it with a different form of C each time. And this is all done automatically by default on Universal 3.5 Pro Streaming. And so give it a try today, especially for voice agents, if you haven't already. I'm so excited to see what you build with Universal 3.5 Pro Streaming. Bye.

ai AI Insights
Arow Summary
Universal 3.5 Pro Streaming, a flagship speech-to-text model, has launched with a key feature called Context Carryover. Traditional transcription models treat each utterance independently, which leads to errors when answers are short or ambiguous (e.g., "C"). Context Carryover retains recent conversational context and uses it to disambiguate subsequent short responses automatically. In a demo, the model correctly interprets "C" differently depending on the preceding question (Spanish "sí," travel mode "sea," multiple-choice option "C"). The feature is enabled by default and is positioned as especially useful for voice agents.
Arow Title
Universal 3.5 Pro Streaming Launches with Context Carryover
Arow Keywords
Universal 3.5 Pro Streaming Remove
speech-to-text Remove
transcription Remove
context carryover Remove
streaming ASR Remove
disambiguation Remove
voice agents Remove
short answers Remove
contextual memory Remove
Universal 3.5 Pro Remove
Arow Key Takeaways
  • Universal 3.5 Pro Streaming is a new flagship speech-to-text model.
  • Context Carryover addresses ambiguity by remembering recent conversational context.
  • Short responses like "C" can be transcribed correctly based on preceding questions.
  • The feature works automatically by default in the streaming model.
  • The update is particularly relevant for building more reliable voice agents.
Arow Sentiments
Positive: The speaker expresses excitement about the launch and highlights improved accuracy and automatic context handling, using enthusiastic language and a successful demo.
Arow Enter your query
{{ secondsToHumanTime(time) }}
Back
Forward
{{ Math.round(speed * 100) / 100 }}x
{{ secondsToHumanTime(duration) }}
close
New speaker
Add speaker
close
Edit speaker
Save changes
close
Share Transcript