Get the latest tech news

Hugging Face: 5 ways enterprises can slash AI costs without sacrificing performance


Ultimately, model makers and enterprises are focusing on the wrong issue: They should be computing smarter, not harder.

Adopt “nudge theory” in system design, set conservative reasoning budgets, limit always-on generative features and require opt-in for high-cost compute modes. The “canonical example,” Luccioni noted, is adding cutlery to takeout: Having people decide whether they want plastic utensils, rather than automatically including them with every order, can significantly reduce waste. Instead of chasing the largest GPU clusters, begin with the question: “What is the smartest way to achieve the result?” For many workloads, smarter architectures and better-curated data outperform brute-force scaling.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of enterprises

enterprises

Photo of Ways

Ways

Photo of Performance

Performance

Related news:

News photo

Pixel Watch running slow? Do this to instantly improve the performance

News photo

Linux 6.17 Performance Looking Even Better After Early Fallout Addressed

News photo

Your smart home device just got a performance and security boost for free