Get the latest tech news

How Sakana AI’s new evolutionary algorithm builds powerful AI models without expensive retraining


M2N2 is a model merging technique that creates powerful multi-skilled agents without the high cost and data needs of retraining.

For enterprises looking to build custom AI solutions, the approach offers a powerful and efficient way to create specialized models by combining the strengths of existing open-source variants. In comments to VentureBeat, the paper’s authors said model merging is a gradient-free process that only requires forward passes, making it computationally cheaper than fine-tuning, which involves costly gradient updates. The results showed that its diversity-preservation mechanism was key, allowing it to maintain an archive of models with complementary strengths that facilitated effective merging while systematically discarding weaker solutions.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of Sakana AI’s

Sakana AI’s

Photo of powerful AI models

powerful AI models

Photo of expensive retraining

expensive retraining

Related news:

News photo

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

News photo

Google opens its most powerful AI models to everyone, the next stage in its virtual agent push

News photo

Sakana AI’s CycleQD outperforms traditional fine-tuning methods for multi-skill language models