StreamingMemeStreamingMeme
LeaderboardsEventsSubmit News
SUBSCRIBE

Daily Brief

The streaming industry in your inbox every morning.

Daily Brief

The streaming industry in your inbox every morning.

StreamingMeme

The streaming technology industry news aggregator.

About UsNewsletterSubmit News
© 2026 StreamingMeme. All rights reserved.
← AI for Video
AI & VideoProduct Launch

OpenAI adds live voice, translation, and transcription models

OpenAI adds live voice, translation, and transcription models
OpenAI

OpenAI has introduced three new audio models into its API: GPT-Realtime-2 for real-time voice intelligence, GPT-Realtime-Translate for live translation across 70+ input and 13 output languages, and GPT-Realtime-Whisper for low-latency streaming speech-to-text transcription. These models aim to enable developers to build more natural, intelligent, and responsive voice applications by improving reasoning, context handling, and real-time processing capabilities.

Key Takeaways

  • GPT-Realtime-2 is OpenAI’s first voice model with GPT-5-class reasoning and a 32K-to-128K context window increase for longer sessions.
  • GPT-Realtime-Translate supports more than 70 input languages and 13 output languages, and OpenAI cites use cases including customer support, cross-border sales, education, events, media, and creator platforms.
  • GPT-Realtime-Whisper is a streaming transcription model that turns speech into text live as the speaker talks.
  • OpenAI says the Realtime API supports EU Data Residency and includes active classifiers that can halt sessions flagged for harmful content.
  • Pricing starts at $32 per 1M audio input tokens for GPT-Realtime-2, $0.034 per minute for GPT-Realtime-Translate, and $0.017 per minute for GPT-Realtime-Whisper.

Why It Matters

OpenAI is pushing realtime audio beyond simple turn-taking toward models that can reason, translate, and transcribe while a conversation is still in progress. That matters for voice interfaces in streaming-adjacent workflows such as captions, live translation, support, and other spoken interactions the company explicitly lists. The ecosystem signal is that OpenAI is positioning voice as a production API layer, not just a demo feature, with named examples from Zillow, Deutsche Telekom, Priceline, Vimeo, and BolnaAI. What to watch next: whether developers adopt GPT-Realtime-2’s higher-context and adjustable reasoning modes in shipped products, not just in the Playground.


Read full article at openai.com

Related Articles

Agora: Agora Integrates OpenAI Real-Time API for Low-Latency Conversational AI
Amazon Web Services, Inc.: AWS SageMaker Adds Multi-Turn RL for Specialized AI Model Training
wTVision: wTVision Debuts CricketStats CG, Enters Cricket Graphics Market in Bangladesh

Newest

1 day ago
Pro AVL Central: Blackmagic Debuts Fairlight Live, Boosts DaVinci Resolve 21 with AI and Photo Tools
1 day ago
NewscastStudio: MXL Rapid Development Challenges Traditional Broadcast Standardization
1 day ago
Smpte: SMPTE Media Technology Summit Returns to Pasadena November 2026
1 day ago
Tech Times: Let's Encrypt charts Merkle Tree Certificate path for post-quantum TLS
1 day ago
cvefeed.io: Netty Fixes Undetected Stream Truncation in Chunked OHTTP Messages
1 day ago
Ietf: IETF Advances Network Protocol Drafts for Streaming Infrastructure
1 day ago
Forasoft: Fora Soft Launches Monthly WebRTC & Real-time Video Engineering Report
1 day ago
Atis: ATIS Outlines Practical Roadmap for North American 5G Standalone Deployment
1 day ago
Youtube: 3GPP Advances 5G-Advanced with Release 19, Commences 6G Studies
1 day ago
3gpp: 3GPP Release 6 Refines Radio Network Rules for Cell Handover, Measurement
1 day ago
3gpp: 3GPP Details 20 Mobile Telecommunications Releases, Including Open Release 21
1 day ago
Pro AVL Central: Matrox Launches IPMX-Ready Maevex MGX Series for 4K60 AV-over-IP
1 day ago
GitHub: OpenMOSS Expands MOSS-TTS Family with Nano Model, Enhanced SoundEffects
1 day ago
NewscastStudio: Media Exchange Layer (MXL) Complements ST 2110 for Software-Defined Production
1 day ago
Penligent Security Blog – AI-Driven Hacking Tutorials, Exploit PoCs & Cybersecurity Research: HTTP/2 Bomb Vulnerability: Apache, Envoy, Nginx Face DoS Risk
1 day ago
SamsungNewsroom: Samsung Galaxy S26 Series Introduces Cine LUT for Accessible Mobile Color Grading
1 day ago
KORE1: Spotify Engineers: A Six-Profile Map for Strategic Hiring
1 day ago
TV Tech: GatesAir Establishes Brazil Hub for DTV+ Rollout, Local Support
1 day ago
Telecompaper: Technicolor Joins Pearl TV Initiative for Affordable ATSC 3.0 Converter Boxes
1 day ago
law360: Generative AI, SEPs Drive IP Licensing Activity from May 22-June 4

Upcoming Events

Jun
8–11
NEM Dubrovnikhttps://neweumarket.com/dubrovnik/
Jun
11–12
Arctic 15https://arctic15.com/
Jun
13–19
InfoCommhttps://www.infocommshow.org/
Jun
16–19
Stream TV Show (formerly the Pay TV Show)https://www.streamtvshow.com/
Jun
17–19
Content Tokyo 2024https://www.content-tokyo.jp/ja-jp.html
View all events →

Top Sources

  1. 1.wTVision163
  2. 2.MSN150
  3. 3.Calendly86
  4. 4.Advanced Television63
  5. 5.Sports Video Group62
  6. 6.Cord Cutters News40
  7. 7.TV Technology39
  8. 8.AOL34
Full leaderboards →

Newest

1 day ago
Pro AVL Central: Blackmagic Debuts Fairlight Live, Boosts DaVinci Resolve 21 with AI and Photo Tools
1 day ago
NewscastStudio: MXL Rapid Development Challenges Traditional Broadcast Standardization
1 day ago
Smpte: SMPTE Media Technology Summit Returns to Pasadena November 2026
1 day ago
Tech Times: Let's Encrypt charts Merkle Tree Certificate path for post-quantum TLS
1 day ago
cvefeed.io: Netty Fixes Undetected Stream Truncation in Chunked OHTTP Messages
1 day ago
Ietf: IETF Advances Network Protocol Drafts for Streaming Infrastructure
1 day ago
Forasoft: Fora Soft Launches Monthly WebRTC & Real-time Video Engineering Report
1 day ago
Atis: ATIS Outlines Practical Roadmap for North American 5G Standalone Deployment
1 day ago
Youtube: 3GPP Advances 5G-Advanced with Release 19, Commences 6G Studies
1 day ago
3gpp: 3GPP Release 6 Refines Radio Network Rules for Cell Handover, Measurement
1 day ago
3gpp: 3GPP Details 20 Mobile Telecommunications Releases, Including Open Release 21
1 day ago
Pro AVL Central: Matrox Launches IPMX-Ready Maevex MGX Series for 4K60 AV-over-IP
1 day ago
GitHub: OpenMOSS Expands MOSS-TTS Family with Nano Model, Enhanced SoundEffects
1 day ago
NewscastStudio: Media Exchange Layer (MXL) Complements ST 2110 for Software-Defined Production
1 day ago
Penligent Security Blog – AI-Driven Hacking Tutorials, Exploit PoCs & Cybersecurity Research: HTTP/2 Bomb Vulnerability: Apache, Envoy, Nginx Face DoS Risk
1 day ago
SamsungNewsroom: Samsung Galaxy S26 Series Introduces Cine LUT for Accessible Mobile Color Grading
1 day ago
KORE1: Spotify Engineers: A Six-Profile Map for Strategic Hiring
1 day ago
TV Tech: GatesAir Establishes Brazil Hub for DTV+ Rollout, Local Support
1 day ago
Telecompaper: Technicolor Joins Pearl TV Initiative for Affordable ATSC 3.0 Converter Boxes
1 day ago
law360: Generative AI, SEPs Drive IP Licensing Activity from May 22-June 4

Upcoming Events

Jun
8–11
NEM Dubrovnikhttps://neweumarket.com/dubrovnik/
Jun
11–12
Arctic 15https://arctic15.com/
Jun
13–19
InfoCommhttps://www.infocommshow.org/
Jun
16–19
Stream TV Show (formerly the Pay TV Show)https://www.streamtvshow.com/
Jun
17–19
Content Tokyo 2024https://www.content-tokyo.jp/ja-jp.html
View all events →

Top Sources

  1. 1.wTVision163
  2. 2.MSN150
  3. 3.Calendly86
  4. 4.Advanced Television63
  5. 5.Sports Video Group62
  6. 6.Cord Cutters News40
  7. 7.TV Technology39
  8. 8.AOL34
Full leaderboards →