Get the latest tech news

AI benchmarks are a bad joke – and LLM makers are the ones laughing


None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of ones

ones

Photo of LLM

LLM

Photo of AI benchmarks

AI benchmarks

Related news:

News photo

Can you save on LLM tokens using images instead of text?

News photo

AI benchmarks are a bad joke – and LLM makers are the ones laughing

News photo

Agent-o-rama: build, trace, evaluate, and monitor LLM agents in Java or Clojure