Llama 3.1 405B

Read news on Llama 3.1 405B with our app.

Read more in the app

Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference