Get the latest tech news

Full LLM training and evaluation toolkit


Everything about the SmolLM & SmolLM2 family of models - GitHub - huggingface/smollm: Everything about the SmolLM & SmolLM2 family of models

Our most powerful model is SmolLM2-1.7B-Instruct, which you can use as an assistant with transformers, trl, or using quantized versions with tools like llama.cpp, MLX, and transformers.js. For lighter applications, you can also use the smaller models SmolLM2-360M and SmolLM2-135M, which are suitable for on-device usage and can be integrated similarly. These tools are designed to run locally on your machine without requiring expensive GPU resources.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of training

training

Photo of LLM

LLM

Photo of evaluation

evaluation

Related news:

News photo

Establishing an etiquette for LLM use on Libera.Chat

News photo

OK, I can partly explain the LLM chess weirdness now

News photo

Bluesky Mocks X, Has ‘No Intention’ of Training AI With User Data