Get the latest tech news

Nvidia launches fully open source transcription AI model Parakeet-TDT-0.6B-V2 on Hugging Face


An attractive proposition for commercial enterprises and indie developers looking to build speech recognition and transcription services...

As the generative AI era wears on, the Santa Clara-based company has also been steadily releasing more and more of its own AI models — mostly open source and free for researchers and developers to take, download, modify and use commercially — and the latest among them is Parakeet-TDT-0.6B-v2, an automatic speech recognition (ASR) model that can, in the words of Hugging Face’s Vaibhav “VB” Srivastav, “transcribe 60 minutes of audio in 1 second [mind blown emoji].” Released globally on May 1, 2025, Parakeet-TDT-0.6B-v2 is aimed at developers, researchers, and industry teams building applications such as transcription services, voice assistants, subtitle generators, and conversational AI platforms. Although no specific measures were taken to mitigate demographic bias, the model passed internal quality standards and includes detailed documentation on its training process, dataset provenance, and privacy compliance.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of Nvidia

Nvidia

Photo of Hugging Face

Hugging Face

Related news:

News photo

How to watch NVIDIA CEO Jensen Huang deliver the Computex 2025 keynote

News photo

NVIDIA Encouraging CUDA Users To Upgrade From Maxwell / Pascal / Volta

News photo

Nvidia CEO Jensen Huang Sounds Alarm As 50% Of AI Researchers Are Chinese, Urges America To Reskill Amid 'Infinite Game'