Get the latest tech news

Hugging Face claims its new AI models are the smallest of their kind


A team at dev platform Hugging Face has released what they're claiming are the smallest AI models that can analyze images, videos, and text.

Both models can perform tasks like describing images or video clips and answering questions about PDFs and the elements within them, including scanned text and charts. To train SmolVLM-256M and SmolVLM-500M, the Hugging Face team used The Cauldron, a collection of 50 “high-quality” image and text datasets, and Docmatix, a set of file scans paired with detailed captions. The researchers speculated that this could be because smaller models recognize surface-level patterns in data, but struggle to apply that knowledge in new contexts.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of kind

kind

Photo of new ai models

new ai models

Photo of Hugging Face

Hugging Face

Related news:

News photo

Hugging Face settles suit with AI startup FriendliAI, which had accused it of patent infringement

News photo

Microsoft makes powerful Phi-4 model fully open-source on Hugging Face

News photo

RadioShack is back (kind of) at CES 2025