Get the latest tech news

Anthropic used Pokémon to benchmark its newest AI model

Anthropic used Pokémon to benchmark its newest AI model, Claude 3.7 Sonnet. Really.

In a blog post published Monday, Anthropic said that it tested its latest model, Claude 3.7 Sonnet, on the Game Boy classic Pokémon Red. Image Credits: AnthropicNow, it’s not clear how much computing was required for Claude 3.7 Sonnet to reach those milestones — and how long each took. In the past few months alone, a number of new apps and platforms have cropped up to test models’ game-playing abilities on titles ranging from Street Fighter to Pictionary.

Get the Android app

Or read this on TechCrunch