genai / news / / Interesting Engineering
OpenAI has introduced three new audio models through its API.
OpenAI released GPT-Realtime-2 with GPT-5-class reasoning for real-time voice interactions.
KEY POINTS
- GPT-Realtime-2 expands its context window from 32K to 128K for longer, more complex conversations.
- The GPT-Realtime-Translate model supports real-time translation of over 70 input languages to 13 output languages.
- Deutsche Telekom and Zillow are already building live customer support and voice assistant tools using these models.
- GPT-Realtime-2 showed a 15.2% performance improvement on Big Bench Audio benchmarks over its previous version.
COMPANIES
Summarized by Newsio from Interesting Engineering. How we summarize →