Get the latest tech news

Effort – a possibly new algorithm for LLM Inference


A possibly new algorithm for LLM Inference. Adjust smoothly - and in real time - how many calculations you'd like to do during inference.

With Effort you can adjust smoothly - and in real time - how many calculations you'd like to do during inference of an LLM model. The multiplications are fast, but inference overall is still slightly lacking because some non-essential parts - softmax etc - need improvement. The pink line is actual speed, with a suboptimal implementation of the overhead (calculating norms, attn scores etc).

Get the Android app

Or read this on Hacker News

Read more on:

Photo of effort

effort

Photo of new algorithm

new algorithm

Photo of LLM Inference

LLM Inference

Related news:

News photo

New charging algorithm could double life of li-ion batteries | The new algorithm could greatly reduce the ageing effects of continuous recharge cycles

News photo

Intel's effort to build a foundry biz is costing far more – and taking longer – than expected

News photo

Apple Scraps In-House Effort to Make Watch Displays, Cuts Jobs