Get the latest tech news

Anthropic CEO wants to open the black box of AI models by 2027

Anthropic CEO Dario Amodei set forth a goal for his company to "reliably detect most AI model problems" by 2027.

Anthropic CEO Dario Amodei published an essay Thursday highlighting how little researchers understand about the inner workings of the world’s leading AI models. In “The Urgency of Interpretability,” the CEO says Anthropic has made early breakthroughs in tracing how models arrive at their answers — but emphasizes that far more research is needed to decode these systems as they grow more powerful. Anthropic is one of the pioneering companies in mechanistic interpretability, a field that aims to open the black box of AI models and understand why they make the decisions they do.

Get the Android app

Or read this on TechCrunch