Get the latest tech news

Cerebras launches Qwen3-235B, achieving 1.5k tokens per second


Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.

This kind of fast inference isn't just nice to have -- it shows us what's possible when AI truly keeps pace with developers,” said Saoud Rizwan, CEO of Cline. With today's launch, Cerebras has significantly expanded its inference offering, providing developers looking for an open alternative to OpenAI and Anthropic with comparable levels of model intelligence and code generation capabilities. Moreover, Cerebras delivers something that no other AI provider in the world—closed or open—can do: instant reasoning speed at over 1,500 tokens per second, increasing developer productivity by an order of magnitude vs. GPU solutions.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Cerebras

Cerebras

Photo of tokens

tokens

Photo of qwen3-235b

qwen3-235b

Related news:

News photo

From tokens to thoughts: How LLMs and humans trade compression for meaning

News photo

Not all tokens are meant to be forgotten

News photo

Cerebras achieves 2,500T/s on Llama 4 Maverick (400B)