Get the latest tech news

DeepSeek’s new AI model appears to be one of the best ‘open’ challengers yet


ChCC

In a subset of coding competitions hosted on Codeforces, a platform for programming contests, DeepSeek outperforms models including Meta’s Llama 3.1 405B, OpenAI’s GPT-4o, and Alibaba’s Qwen 2.5 72B. High-Flyer builds its own server clusters for model training, one of the most recent of which reportedly has 10,000 Nvidia A100 GPUs and cost 1 billion yen (~$138 million). Founded by Liang Wenfeng, a computer science graduate, High-Flyer aims to achieve “superintelligent” AI through its DeepSeek org.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of challengers

challengers

Photo of new AI model

new AI model

Photo of DeepSeek

DeepSeek

Related news:

News photo

Learn how GE Healthcare used AWS to build a new AI model that interprets MRIs

News photo

Gmail now uses a new AI model to try and fend off holiday scams

News photo

Writer’s new AI model aims to fix the ‘sameness problem’ in generative content