Scopeora News & Life

© 2026 Scopeora News & Life

OpenAI Unveils Advanced Voice Intelligence Features in API

On Thursday, OpenAI announced the introduction of innovative voice intelligence capabilities within its API, aimed at empowering developers to build applications that can engage in conversation, trans...

OpenAI Unveils Advanced Voice Intelligence Features in API

On Thursday, OpenAI announced the introduction of innovative voice intelligence capabilities within its API, aimed at empowering developers to build applications that can engage in conversation, transcribe discussions, and provide translation services.

The new GPT-Realtime-2 voice model enhances vocal simulation, enabling more realistic interactions. Unlike its predecessor, GPT-Realtime-1.5, this model incorporates GPT-5-class reasoning, allowing it to handle complex user requests more effectively.

Additionally, OpenAI has launched GPT-Realtime-Translate, a feature that offers real-time translation, allowing for seamless conversational exchanges. This tool supports over 70 input languages and 13 output languages, ensuring a broad range of communication options.

Another significant addition is the GPT-Realtime-Whisper transcription capability, which delivers live speech-to-text functionality, capturing dialogue as it unfolds.

OpenAI stated, "Together, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds."

These advancements are particularly beneficial for businesses looking to enhance customer service interactions. However, the features are also poised to impact various sectors, including education, media, and event management.

While the potential of these tools is vast, OpenAI has implemented safety measures to prevent misuse, such as spam or fraudulent activities. The system includes triggers designed to halt conversations that violate content guidelines.

All new voice models are part of OpenAI's Realtime API, with billing structured by usage--Translate and Whisper are charged by the minute, while GPT-Realtime-2 is billed based on token consumption.


Similar News

Trust in the Spotlight: Insights from the Elon Musk-OpenAI Trial
Technology
Trust in the Spotlight: Insights from the Elon Musk-OpenAI Trial

The Elon Musk-OpenAI trial raises critical questions about trust in technology, focusing on transparency and accountabil...

OpenAI Launches Mobile Version of Codex
Technology
OpenAI Launches Mobile Version of Codex

OpenAI's Codex is now mobile, enabling users to manage coding tasks on the go, enhancing productivity and flexibility in...

OpenAI Considers Legal Action Against Apple Over ChatGPT Integration Issues
Technology
OpenAI Considers Legal Action Against Apple Over ChatGPT Integration Issues

OpenAI is exploring legal options against Apple over issues with ChatGPT integration, highlighting challenges in tech pa...