ElevenLabs Dubbing v2 preserves emotion across 90+ languages
ElevenLabs has launched Dubbing v2, a new AI dubbing model that aims to preserve the original speaker's emotion, tone, pacing, and delivery across over 90 languages by conditioning directly on the original performance. The model is designed to automate the dubbing process for creators, marketers, and studios, offering solutions from one-click localization to integrated professional workflows.
Key Takeaways
- Dubbing v2 conditions directly on the original performance instead of relying mainly on transcripts.
- ElevenLabs says the model preserves tone, pacing, delivery, and emotional intent across 90+ languages.
- The system includes sync-aware translation that aligns starts, stops, and pacing automatically.
- ElevenCreative supports one-click localization for YouTube videos and other creator content.
- ElevenProductions combines Dubbing v2 with human translators, voice casting, and audio mixing.
Why It Matters
Dubbing v2 targets one of AI dubbing’s hardest problems: keeping translated speech close to the original speaker’s emotion and delivery. That matters immediately for creators, marketers, studios, and broadcasters that need multilingual video without rebuilding production around separate translation, voice, and mixing steps. ElevenLabs is packaging the model into two workflows — ElevenCreative for one-click creator localization and ElevenProductions for managed services — which ties the technology directly to content operations. The clearest signal to watch is API access, which the company says is coming soon.
Read full article at elevenlabs.io
.webp)