Get the latest tech news

Voxtral – Frontier open source speech understanding models


Introducing frontier open source speech understanding models.

It beats GPT-4o mini Transcribe and Gemini 2.5 Flash across all tasks, and achieves state-of-the-art results on English short-form and Mozilla Common Voice, surpassing ElevenLabs Scribe and demonstrating its strong multilingual capabilities. This includes guidance and tooling for deploying Voxtral across multiple GPUs or nodes, with quantized builds optimized for production throughput and cost efficiency. Domain-specific fine-tuning: Work with our applied AI team to adapt Voxtral to specialized contexts—such as legal, medical, customer support, or internal knowledge bases—improving accuracy for your use case.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Frontier

Frontier

Photo of open source speech

open source speech

Photo of understanding models

understanding models

Related news:

News photo

At the frontier between two lives–the evolutionary origins of pregnancy

News photo

Frontier is helping Arbor build a “vegetarian rocket engine” to power data centers

News photo

A United Arab Emirates Lab Announces Frontier AI Projects—and a New Outpost in Silicon Valley