Get the latest tech news

Disagreement among frontier LLMs on real-world fact-checks


67% of real-world claims expose disagreement among the five top frontier LLMs. Methodology, breakdowns, and data CSV.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of frontier LLMs

frontier LLMs

Photo of check claims

check claims