What You Can Create in the ElevenLabs Creative Platform (Full Transcript)

A tour of ElevenLabs tools for AI voice, music, SFX, images, video, studio editing, and dubbing—plus voice design, cloning, and localization.
Download Transcript (DOCX)
Speakers
add Add new speaker

[00:00:00] Speaker 1: Today, you're going to see everything you can do inside of the Eleven Labs creative platform. From high fidelity audio to stunning image and video, you get direct access to the world's most powerful AI models and tools in a single interface. Inside of your Eleven Labs workspace, you can find all the creative platform tools in the left toolbar. Let's begin with text-to-speech. Here, you can type out any text you want, click generate, and turn it into speech.

[00:00:23] Speaker 2: This is incredible. I mean, I've had thoughts, millions of them, swirling around in here, like a little mental tornado of brilliant observations and witty comebacks.

[00:00:31] Speaker 1: You can change the voice that speaks your text by clicking the voice drop-down menu and selecting any voice you like. There are over 10,000 voices you can choose from. If you select the Eleven V3 model, you can use what we call audio tags, which guide the delivery of the voice from tone, speed, intonation, emotion, and even make the AI laugh.

[00:00:47] Speaker 3: You're so funny. That made me really laugh.

[00:00:51] Speaker 1: You can add audio tags by typing any word you want within square brackets. And again, click generate.

[00:00:56] Speaker 3: Did you see what I just did? Like, I cannot believe what I just did.

[00:01:02] Speaker 1: And with the Eleven V3 model, you can currently generate voice in over 70 languages. To find the perfect voice, we can head to the voice library, and here we can browse over 10,000 voices. You can use the search bar at the top, look through the trending voices, or browse the voice collections for different use cases and languages. You can also start creating custom collections to easily find the voices you like later. And with Eleven Labs, you can even create custom voices and clone your own too. But before I show you that, let's take a look at Image & Video. Image & Video allows you to create any visual asset you need with AI. In the prompt modal at the bottom, you can toggle between generating AI images or videos, choose from among the best AI models in the world, adjust the setting you like, choose the resolution, and the number of assets that we want to generate. Then we simply type out what we want to create and click generate. Now, as you can see, we have a character for our short film. We could then bring this to life by switching to video mode, choosing a model like V03.1, dragging and dropping our favorite image in as the start frame, typing out the action that we want to happen, and then clicking generate. And if we wanted to create a unique voice for this character, we can do that with Eleven Labs voice design. If you go back to the voice library, you can click on create or clone a voice, then you select voice design. Here, you can create a brand new voice simply by describing it. When you click generate, you get three voices to choose from. Pick your favorite, name it, and now it's saved to your voice library. From the same menu as voice design, you can clone your voice. There are two options, instant voice clone, which allows you to clone your voice with as little as 10 seconds of audio, and then professional voice clone. Which allows you to create a higher fidelity voice clone by uploading 30 minutes of your voice. Eleven Labs then trains a custom model based on your voice recordings, available for you to use with all of the other voice tools inside of the creative platform. So you can create voiceovers without having to record them. If you did want to use your own voice, you can record and upload it to VoiceChanger. This allows you to swap your voice for any voice in the voice library while maintaining the exact delivery of your original recording. So you could switch it up and sound a little something like this. And just in case your recording is a little too noisy, you can run it through the voice isolator, which removes any noise distraction. And I mean any.

[00:03:30] Speaker 4: As you can see, I wasn't joking.

[00:03:40] Speaker 1: Going further than just the spoken audio, you can also generate high quality music with commercial rights for your films, television, podcasts, social media posts, advertisements, and gaming. Simply describe the music you need, choose your preferred length, number of generations, and choose if you want Eleven Music to add lyrics, paste your own, or simply stick with instrumentals. Then you click generate and Eleven Music creates an entirely unique track based on your prompt. With Eleven Music, you can then further edit your track by adjusting the length in your timeline, editing the lyrics, the style tags, or simply by prompting it to make the changes you want. Once you have the result you're looking for, you can download the full track and even download individual stems. Now as well as music, we can also generate individual sound effects with the sound effects generator. If we click on sound effects in the left toolbar, we're taken to the sound effects generator. Just like with music, we simply describe what we want to hear and Eleven Labs will generate it for us. Literally anything. And just like with the voice library and Eleven Music, there's a full library of generated sounds and collections that you can browse. Now, once you've generated all of the assets you need for your project, you can assemble them all together with Eleven Labs Studio. Studio is the glue that ties every single one of the tools that we've just been through together. You can directly import all of the assets you've generated and generate new ones. The timeline layout allows you to compile your assets and layer your audio so you have full creative control over the assets you've created. Over the output of your content. And once you're done, you can render your video. Now, Eleven Labs allows you to take your finished content even further with its AI dubbing capabilities. This allows you to break language. And you can even use productions, which is a done-for-you, human-in-the-loop dubbing service powered by AI. So that's a walkthrough of the Eleven Labs creative platform. You're all in one workspace for creating high quality voice, music, sound effects, image, and video with AI. And on top of the creative platform, Eleven Labs also offers the agents platform where voice powers real interactive AI experiences. Head to elevenlabs.io or click the first link in the description and begin creating with AI.

ai AI Insights
Arow Summary
A walkthrough of the ElevenLabs Creative Platform showing how to generate and assemble AI voice, music, sound effects, images, and video. It demonstrates text-to-speech with voice selection and audio tags (tone/emotion/laughter), multilingual generation, browsing and saving voices in the voice library, creating voices via Voice Design, cloning voices (instant with ~10 seconds or professional with ~30 minutes), VoiceChanger to swap voices while keeping original delivery, voice isolation for noise removal, Eleven Music for commercially usable tracks with lyric/stem editing, a sound effects generator with a searchable library, Studio for timeline-based assembly and rendering, and AI dubbing/production services to localize content across languages. It closes by noting an Agents platform for interactive voice AI experiences.
Arow Title
Overview of ElevenLabs Creative Platform capabilities
Arow Keywords
ElevenLabs Remove
creative platform Remove
text-to-speech Remove
voice library Remove
audio tags Remove
Eleven V3 Remove
multilingual TTS Remove
voice design Remove
voice cloning Remove
instant voice clone Remove
professional voice clone Remove
VoiceChanger Remove
voice isolator Remove
AI image generation Remove
AI video generation Remove
Eleven Music Remove
sound effects generator Remove
Studio timeline Remove
rendering Remove
AI dubbing Remove
productions service Remove
agents platform Remove
Arow Key Takeaways
  • Text-to-speech supports extensive voice choice (10,000+), audio tags for expressive delivery, and 70+ languages with the Eleven V3 model.
  • The platform includes AI image and video generation with selectable models and settings, enabling rapid asset creation from prompts.
  • Users can create new voices via Voice Design or clone voices instantly (~10s audio) or professionally (~30 min) for higher fidelity.
  • VoiceChanger swaps a recording into another voice while preserving timing and delivery; Voice Isolator removes background noise.
  • Eleven Music generates original, commercially usable tracks with options for lyrics/instrumental, timeline editing, and stem downloads.
  • A sound effects generator creates and organizes SFX via prompts with a browsable library and collections.
  • ElevenLabs Studio provides a timeline to assemble all generated assets and render final videos.
  • AI dubbing and human-in-the-loop production services help localize content; an Agents platform enables interactive voice AI experiences.
Arow Sentiments
Positive: Enthusiastic, promotional tone highlighting breadth of features, ease of use, high quality outputs, and strong capabilities (10,000+ voices, 70+ languages, commercial music rights).
Arow Enter your query
{{ secondsToHumanTime(time) }}
Back
Forward
{{ Math.round(speed * 100) / 100 }}x
{{ secondsToHumanTime(duration) }}
close
New speaker
Add speaker
close
Edit speaker
Save changes
close
Share Transcript