Get the latest tech news

Ai2 says its new AI model beats one of DeepSeek’s best


Move over, DeepSeek. Seattle-based nonprofit AI lab Ai2 has released a benchmark-topping model called Tulu3-405B.

A spokesperson for Ai2 told TechCrunch that the lab believes Tulu3-405B “underscores the U.S.’ potential to lead the global development of best-in-class generative AI models.” “This milestone is a key moment for the future of open AI, reinforcing the U.S.’ position as a leader in competitive, open-source models,” the spokesperson said. Ai2 claims that on the benchmark PopQA, a set of 14,000 specialized knowledge questions sourced from Wikipedia, Tulu3-405B beat not only DeepSeek V3 and GPT-4o, but also Meta’s Llama 3.1 405B model.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of new AI model

new AI model

Photo of AI2

AI2

Photo of DeepSeek

DeepSeek

Related news:

News photo

Microsoft talks up 'significant capital investments' in AI as sector reacts to DeepSeek

News photo

[The Economist] The real meaning of the DeepSeek drama: The Chinese model-maker has panicked investors. But it is good for the users of AI

News photo

DeepSeek exposed internal database containing chat histories and sensitive data