Get the latest tech news
AI disruptor DeepSeek's next-gen model delayed by Nvidia GPU export restrictions to China — short supply of AI GPUs hinders development
No hardware, no AI?
DeepSeek used a cluster consisting of 50,000 Hopper GPUs — including 30,000 H20s, 10,000 H800s, and 10,000 H100s — obtained by its investor High-Flyer Capital Management — to train its R1 model. The Information reports citing two individuals familiar with the project that DeepSeek team has been working intensively on the model, but CEO Liang Wenfeng is not yet satisfied with its capabilities. You may like Should DeepSeek's upcoming R2 model surpass the capabilities of currently available open alternatives, usage is expected to surge beyond what Chinese cloud platforms can handle, according to staff at those firms cited by The Information.
Or read this on r/technology