Read news on Llama 3.1 405B with our app.
Read more in the app
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference