Get the latest tech news

ElevenLabs’ new speech-to-text model Scribe is here with highest accuracy rate so far (96.7% for English)


Scribe’s pricing structure makes it competitive for businesses that require high-volume transcription services with API-based integration.

ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest accuracy across multiple languages. According to the company’s benchmarks, it outperforms Google’s Gemini 2.0 Flash, OpenAI’s Whisper v3, and Deepgram Nova-3 on accurately converting spoken speech into text on the web, achieving new record-low error rates. Timing is everything, and ElevenLabs chose to launch Scribe the same day as rival Hume AI unveiled Octave, an LLM-powered text-to-speech model that allows users to customize AI-generated voices with adjustable emotions.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of ElevenLabs

ElevenLabs

Photo of Text

Text

Photo of model

model

Related news:

News photo

ElevenLabs is launching its own speech-to-text model

News photo

Apple fixing bug in iPhone speech-to-text glitch which interprets 'racist' as 'Trump'

News photo

ElevenLabs now lets authors create and publish audiobooks on its own platform