Cerebras Inference

Read news on Cerebras Inference with our app.

Read more in the app

Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference

Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s