Get the latest tech news

GPU-rich labs have won: What's left for the rest of us is distillation


AI inference for 90% lower cost

Fortune-500 companies would spend tens of millions and proudly that they trained their own SOTA models, only to have them be antiquated months or weeks after their release. Even with the "generosity" of Meta and Alibaba (Qwen), who have spent hundreds of millions just to release model weights, open source simply cannot compete with the hegemony of superintelligence labs when it comes to general intelligence. If an LLM can't solve a particular task acceptably yet, it's not the worst strategy to build what's possible now and wait a couple of months.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of rest

rest

Photo of distillation

distillation

Photo of gpu-rich labs

gpu-rich labs

Related news:

News photo

Google's Pixel Watch 4, Fold Pro 10 and Buds 2a are rumored to launch later than the rest of its new gear

News photo

Google URL Shortener grants mercy for 'active' links, but time dwindles for the rest

News photo

Distillation makes AI models smaller and cheaper