RTX4090

Read news on RTX4090 with our app.

Read more in the app

SEQUOIA: Exact Llama2-70B on an RTX4090 with half-second per-token latency