Get the latest tech news
Two different tricks for fast LLM inference
Anthropic and OpenAI both recently announced “fast mode”: a way to interact with their best coding model at significantly higher speeds. These two versions of fast mode are very different.
None
Or read this on Hacker News