Get the latest tech news
DeepSeek’s new AI model appears to be one of the best ‘open’ challengers yet
ChCC
In a subset of coding competitions hosted on Codeforces, a platform for programming contests, DeepSeek outperforms models including Meta’s Llama 3.1 405B, OpenAI’s GPT-4o, and Alibaba’s Qwen 2.5 72B. High-Flyer builds its own server clusters for model training, one of the most recent of which reportedly has 10,000 Nvidia A100 GPUs and cost 1 billion yen (~$138 million). Founded by Liang Wenfeng, a computer science graduate, High-Flyer aims to achieve “superintelligent” AI through its DeepSeek org.
Or read this on TechCrunch