Get the latest tech news
We got 207 tok/s with Qwen3.5-27B on an RTX 3090
Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware. - Luce-Org/lucebox-hub
None
Or read this on Hacker NewsGet the latest tech news
Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware. - Luce-Org/lucebox-hub
None
Or read this on Hacker NewsRead more on:
Related news: