Get the latest tech news

How Does Claude 4 Think? – Sholto Douglas and Trenton Bricken


Scaling reinforcement learning, tracing circuits, and the path to fully autonomous agents

I think, in general, when people are talking about the separate model… For example, most of the robotics companies are doing this bi-level thing, where they have a motor policy that's running at 60 hertz or whatever, and some higher-level visual language model. A big failure mode that a lot of ML researchers have is you do these overly complicated things that don't think hard enough about the hardware systems that you have in mind, whereas with the first DeepSeek sparsity MoE solution, they design these rack and node-level load balancing losses. One of the reasons that I worry about turning AGI into a national security issue, or having it have extremely close ties with the government, the Manhattan Project thing, is that it disproportionately redirects the use of AI towards military tech, mosquito drones and whatever.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Think

Think

Photo of douglas

douglas

Photo of sholto douglas

sholto douglas

Related news:

News photo

Watch R1 "think" with animated chains of thought

News photo

The Later Years of Douglas Adams

News photo

IBM CEO praises real open source for enterprise gen AI, new efforts emerge at Think 2024