genai / news / 2026-05-07 / Interesting Engineering

OpenAI has introduced three new audio models through its API.

OpenAI released GPT-Realtime-2 with GPT-5-class reasoning for real-time voice interactions.

KEY POINTS

GPT-Realtime-2 expands its context window from 32K to 128K for longer, more complex conversations.
The GPT-Realtime-Translate model supports real-time translation of over 70 input languages to 13 output languages.
Deutsche Telekom and Zillow are already building live customer support and voice assistant tools using these models.
GPT-Realtime-2 showed a 15.2% performance improvement on Big Bench Audio benchmarks over its previous version.

COMPANIES

Share X LinkedIn

Summarized by Newsio from Interesting Engineering. How we summarize →

Apple announced new AI features for its Photos app on Monday.