new benchmark

Read news on new benchmark with our app.

Read more in the app

In 'Milestone' for Open Source, Meta Releases New Benchmark-Beating Llama 4 Models

Even some of the best AI can’t beat this new benchmark

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

A new benchmark for AI investment: Swift Ventures unveils system to separate talk from action

A New Benchmark for the Risks of AI

Can AI really compete with human data scientists? OpenAI’s new benchmark puts it to the test

Open-source AI narrows gap with proprietary leaders, new benchmark reveals

Claude Sonnet 3.5 – The New Benchmark in Conversational AI

Sierra’s new benchmark reveals how well AI agents perform at real work

IQM achieves a new benchmark on 20-qubit quantum computer

Microsoft sets new benchmark in AI data security with Purview upgrades