Read news on Cerebras Inference with our app.
Read more in the app
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s