Get the latest tech news

Less is more: How ‘chain of draft’ could cut AI costs by 90% while improving performance

Zoom researchers unveil "chain of draft," which cuts AI token usage by 92%, transforming the economics of language model deployment.

“When solving complex tasks — whether mathematical problems, drafting essays or coding — we often jot down only the critical pieces of information that help us progress,” the researchers explain. As companies increasingly integrate sophisticated AI systems into their operations, computational costs and response times have emerged as significant barriers to widespread adoption. The technique could prove especially valuable for latency-sensitive applications like real-time customer support, mobile AI, educational tools and financial services, where even small delays can significantly impact user experience.

Get the Android app

Or read this on Venture Beat