Get the latest tech news

Former DeepSeeker and collaborators release new method for training reliable AI agents: RAGEN


RAGEN stands out not just as a technical contribution but as a conceptual step toward more autonomous, reasoning-capable AI agents.

Unlike static tasks like math solving or code generation, RAGEN focuses on multi-turn, interactive settings where agents must adapt, remember, and reason in the face of uncertainty. Built on a custom RL framework called StarPO (State-Thinking-Actions-Reward Policy Optimization), the system explores how LLMs can learn through experience rather than memorization. As AI continues to move toward autonomy, projects like RAGEN help illuminate what it takes to train models that learn not just from data, but from the consequences of their own actions.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of new method

new method

Photo of reliable AI agents

reliable AI agents

Photo of RAGEN

RAGEN

Related news:

News photo

New method lets DeepSeek and other models answer ‘sensitive’ questions

News photo

Electricity from rainwater: New method shows promise | In tests, the method was able to power up 12 LED lights.

News photo

Skullcandy’s new Method 360 ANC earbuds have been tuned by Bose