Get the latest tech news

A new, challenging AGI test stumps most AI models


The Arc Prize Foundation has a new test for AGI that leading AI models from Anthropic, Google, and DeepSeek score poorly on.

The ARC-AGI tests consist of puzzle-like problems where an AI has to identify visual patterns from a collection of different-colored squares, and generate the correct “answer” grid. a sample question from Arc-AGI-2 (credit: Arc Prize).In a post on X, Chollet claimed ARC-AGI-2 is a better measure of an AI model’s actual intelligence than the first iteration of the test, ARC-AGI-1. “Intelligence is not solely defined by the ability to solve problems or achieve high scores,” Arc Prize Foundation co-founder Greg Kamradt wrote in a blog post.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of AI models

AI models

Photo of agi

agi

Related news:

News photo

Most AI experts say chasing AGI with more compute is a losing strategy | Is the industry pouring billions into a dead end?

News photo

A high schooler built a website that lets you challenge AI models to a Minecraft build-off

News photo

The AI leaders bringing the AGI debate down to Earth