AI & VideoProduct LaunchMay 11, 2026

OpenAI adds three realtime voice models with GPT-5-class reasoning

OpenAI has released three new real-time voice models, GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, for its API. These models incorporate "GPT-5-class reasoning" capabilities.

Key Takeaways

GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper are now available in OpenAI's API.
OpenAI says the new realtime voice models add live reasoning.
The models are described as having GPT-5-class reasoning capabilities.

Why It Matters

OpenAI is putting three realtime voice models into its API with live reasoning, which raises the bar for voice-driven application development right now. For streaming and video products, the relevant signal is that the API now includes dedicated models for realtime voice, translation, and Whisper-style use cases rather than a single general-purpose path. What to watch next: how OpenAI documents the model differences and whether API developers adopt GPT-Realtime-2, GPT-Realtime-Translate, or GPT-Realtime-Whisper for specific production workflows.

Read full article at ghacks.net

Get this in your inbox → Subscribe