AI & VideoProduct LaunchMay 11, 2026

OpenAI expands Realtime API with voice translation tools

OpenAI has introduced GPT-Realtime-2, an expansion of its Realtime API. This update includes new translation and transcription models designed to enhance the speed and capability of AI voice agents.

Key Takeaways

GPT-Realtime-2 is an expansion of OpenAI’s Realtime API.
The update adds new translation models for AI voice agents.
OpenAI also added new transcription models.
The release is aimed at faster AI voice agents.
The article describes the new capability as covering translation, transcription, and task handling.

Why It Matters

This expands the Realtime API from voice interaction into more explicit translation and transcription workflows, which matters for teams building AI voice agents inside streaming and media products. The article does not detail pricing, latency gains, or customer availability, so the immediate signal is capability breadth rather than commercial impact. For the broader ecosystem, the relevant change is that OpenAI is adding more functions into a single voice-agent stack. What to watch next is whether OpenAI publishes benchmark data or product access details for GPT-Realtime-2 and the new models.

Read full article at eweek.com

Get this in your inbox → Subscribe

Enjoy our coverage?

Add StreamingMeme as a preferred source on Google to see more of our streaming news at the top of your Search results.

Add as preferred source

NVIDIA: NVIDIA’s Nemotron 3 Nano Omni targets multimodal agent reasoning

YouTube: Google brings AI video editing and Ask YouTube to YouTube

TV Technology: Brightcove adds scene-level ad signals and cue point recommendations

Broadcast: EVS Embeds AI for Deblurring, Player Tracking, and Vertical Reframing

← AI for Video

AI & VideoProduct LaunchMay 11, 2026

OpenAI expands Realtime API with voice translation tools

eWeek

OpenAI has introduced GPT-Realtime-2, an expansion of its Realtime API. This update includes new translation and transcription models designed to enhance the speed and capability of AI voice agents.

Key Takeaways

GPT-Realtime-2 is an expansion of OpenAI’s Realtime API.
The update adds new translation models for AI voice agents.
OpenAI also added new transcription models.
The release is aimed at faster AI voice agents.
The article describes the new capability as covering translation, transcription, and task handling.