Get the latest tech news

Meta Got Caught Gaming AI Benchmarks


Meta released two new Llama 4 models over the weekend -- Scout and Maverick -- with claims that Maverick outperforms GPT-4o and Gemini 2.0 Flash on benchmarks. Maverick quickly secured the number-two spot on LMArena, behind only Gemini 2.5 Pro. Researchers have since discovered that Meta used an "...

Meta released two new Llama 4 models over the weekend -- Scout and Maverick -- with claims that Maverick outperforms GPT-4o and Gemini 2.0 Flash on benchmarks. Maverick quickly secured the number-two spot on LMArena, behind only Gemini 2.5 Pro.Researchers have since discovered that Meta used an "experimental chat version" of Maverick for LMArena testing that was " optimized for conversationality" rather than the publicly available version. In response, LMArena said "Meta's interpretation of our policy did not match what we expect from model providers" and announced policy updates to prevent similar issues.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Meta

Meta

Photo of gaming

gaming

Photo of benchmarks

benchmarks

Related news:

News photo

Meta expands restrictions for teen users to Facebook and Messenger

News photo

Meta brings ‘teen accounts’ to Facebook and Messenger

News photo

Teens Blocked From Instagram Livestreaming as Meta Boosts Safety