Get the latest tech news
Anthropic used Pokémon to benchmark its newest AI model
Anthropic used Pokémon to benchmark its newest AI model, Claude 3.7 Sonnet. Really.
In a blog post published Monday, Anthropic said that it tested its latest model, Claude 3.7 Sonnet, on the Game Boy classic Pokémon Red. Image Credits: AnthropicNow, it’s not clear how much computing was required for Claude 3.7 Sonnet to reach those milestones — and how long each took. In the past few months alone, a number of new apps and platforms have cropped up to test models’ game-playing abilities on titles ranging from Street Fighter to Pictionary.
Or read this on TechCrunch