Get the latest tech news
OpenAI upgrades its transcription and voice-generating AI models
OpenAI is bringing new transcription and voice-generating AI models to its API that the company claims improve upon its previous releases. For OpenAI, the
Trained on “diverse, high-quality audio datasets,” the new models can better capture accented and varied speech, OpenAI claims, even in chaotic environments. Whisper notoriously tended to fabricate words — and even whole passages — in conversations, introducing everything from racial commentary to imagined medical treatments into transcripts. According to OpenAI’s internal benchmarks, gpt-4o-transcribe, the more accurate of the two transcription models, has a “word error rate” approaching 30% for Indic and Dravidian languages like Tamil, Telugu, Malayalam, and Kannada.
Or read this on TechCrunch