Get the latest tech news
ElevenLabs’ new speech-to-text model Scribe is here with highest accuracy rate so far (96.7% for English)
Scribe’s pricing structure makes it competitive for businesses that require high-volume transcription services with API-based integration.
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest accuracy across multiple languages. According to the company’s benchmarks, it outperforms Google’s Gemini 2.0 Flash, OpenAI’s Whisper v3, and Deepgram Nova-3 on accurately converting spoken speech into text on the web, achieving new record-low error rates. Timing is everything, and ElevenLabs chose to launch Scribe the same day as rival Hume AI unveiled Octave, an LLM-powered text-to-speech model that allows users to customize AI-generated voices with adjustable emotions.
Or read this on Venture Beat