Get the latest tech news

OpenAI O3 breakthrough high score on ARC-AGI-PUB


OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.

The reason why solving a single ARC-AGI task can end up taking up tens of millions of tokens and cost thousands of dollars is because this search process has to explore an enormous number of paths through program space – including backtracking. Regardless, the current performance represents a remarkable achievement, and a clear confirmation that intuition-guided test-time search over program space is a powerful paradigm to build AI systems that can adapt to arbitrary tasks. o3 fixes the fundamental limitation of the LLM paradigm – the inability to recombine knowledge at test time – and it does so via a form of LLM-guided natural language program search.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of ARC-AGI-PUB

ARC-AGI-PUB

Photo of high score

high score

Photo of OpenAI O3

OpenAI O3