Get the latest tech news

No retraining needed: Sakana’s new AI model changes how machines learn


Sakana found that self-adaptive models can modify their weights during inference to adjust behavior to new and unseen tasks.

This is the latest in a series of techniques that aim to improve the abilities of large language models(LLMs) at inference time, making them increasingly useful for everyday applications across different domains. “By selectively adjusting critical components of the model weights, our framework allows LLMs to dynamically adapt to new tasks in real time,” the researchers write in a blog post published on the company’s website. Titans, an architecture developed by researchers at Google, tackles the problem from a different angle, giving language models the ability to learn and memorize new information at inference time.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of Machines

Machines

Photo of new AI model

new AI model

Photo of sakana

sakana

Related news:

News photo

The Complete Text of "All Watched over by Machines of Loving Grace"

News photo

Show HN: An SSH based utility to transfer pipes across machines – beam

News photo

"The PSP was one of the first machines that had the hardware to allow you to play 'proper console games' on the go": Developers celebrate 20 years of Sony's handheld