Get the latest tech news

Anthropic used Pokémon to benchmark its newest AI model


Anthropic used Pokémon to benchmark its newest AI model, Claude 3.7 Sonnet. Really.

In a blog post published Monday, Anthropic said that it tested its latest model, Claude 3.7 Sonnet, on the Game Boy classic Pokémon Red. Image Credits: AnthropicNow, it’s not clear how much computing was required for Claude 3.7 Sonnet to reach those milestones — and how long each took. In the past few months alone, a number of new apps and platforms have cropped up to test models’ game-playing abilities on titles ranging from Street Fighter to Pictionary.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of Pokémon

Pokémon

Photo of Anthropic

Anthropic

Photo of newest AI model

newest AI model

Related news:

News photo

Anthropic Launches the World’s First ‘Hybrid Reasoning’ AI Model

News photo

Anthropic’s New AI Model Lets Users Decide How Much It Reasons

News photo

Pokémon wins "substantial" damages and public apology in long-running battle against Chinese clone game