Read news on tok with our app.
Read more in the app
We got 207 tok/s with Qwen3.5-27B on an RTX 3090
Qwen3.5-397B at 4.74 tok/s using 5.9GB RAM
Deepseek R1 Distill 8B Q40 on 4 x Raspberry Pi 5