Get the latest tech news

Mlx-community/OLMo-2-0325-32B-Instruct-4bit


OLMo 2 32B [claims to be](https://simonwillison.net/2025/Mar/13/ai2/) "the first fully-open model (all data, code, weights, and details are freely available) to outperform GPT3.5-Turbo and GPT-4o mini". Thanks to the MLX project …

That downloads 17GB to~/.cache/huggingface/hub/models--mlx-community--OLMo-2-0325-32B-Instruct-4bit. To start an interactive chat with OLMo 2: The-o unlimited 1 removes the cap on the number of output tokens - the default for llm-mlx is 1024 which isn't enough to attempt to draw a pelican.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of community

community

Photo of OLMo-2

OLMo-2

Photo of Instruct-4bit

Instruct-4bit

Related news:

News photo

Vancouver’s last few pay phones still serve community as a 'lifeline'

News photo

Meta plans to test and tinker with X’s community notes algorithm | X algorithm will link community notes across Facebook, Instagram, and Threads.

News photo

Meta Plans To Test and Tinker With X's Community Notes Algorithm