Get the latest tech news

ElevenLabs is launching its own speech-to-text model


ElevenLabs, an AI startup that just raised a $180 million mega funding round, has been primarily known for its audio generation prowess. The company took

This list includes English (claimed accuracy rate of 97%), French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. The company said that the model outperformed Google Gemini 2.0 Flash and Whisper Large V3 across multiple languages in FLEURS & Common Voice benchmark tests. The model also has smart speaker diarization to tell you who is speaking, timestamp at word level for accurate subtitles, and auto-tagging sound events like audience laughters.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of ElevenLabs

ElevenLabs

Photo of Text

Text

Photo of model

model

Related news:

News photo

Hume launches new text-to-speech model Octave that generates custom AI voices with adjustable emotions

News photo

Apple fixing bug in iPhone speech-to-text glitch which interprets 'racist' as 'Trump'

News photo

ElevenLabs now lets authors create and publish audiobooks on its own platform