Get the latest tech news

AI benchmarks are a bad joke – and LLM makers are the ones laughing

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of ones

ones

Photo of LLM

LLM

Photo of AI benchmarks

AI benchmarks

Related news:

Can you save on LLM tokens using images instead of text?

AI benchmarks are a bad joke – and LLM makers are the ones laughing

Agent-o-rama: build, trace, evaluate, and monitor LLM agents in Java or Clojure

« Why hasn't there been a new major sports league?

Things I've Heard Boomers Say That I Agree with 100% »