Get the latest tech news

DeepSeek-V3 Now Runs At 20 Tokens Per Second On Mac Studio


An anonymous reader quotes a report from VentureBeat: Chinese AI startup DeepSeek has quietly released a new large language model that's already sending ripples through the artificial intelligence industry -- not just for its capabilities, but for how it's being deployed. The 641-gigabyte model, dub...

An anonymous reader quotes a report from VentureBeat: Chinese AI startup DeepSeek has quietly released a new large language model that's already sending ripples through the artificial intelligence industry -- not just for its capabilities, but for how it's being deployed. The 641-gigabyte model, dubbed DeepSeek-V3-0324, appeared on AI repository Hugging Face today with virtually no announcement (just an empty README file), continuing the company's pattern of low-key but impactful releases. [...] Simon Willison, a developer tools creator, noted in a blog post that a 4-bit quantized version reduces the storage footprint to 352GB, making it feasible to run on high-end consumer hardware like the Mac Studio with M3 Ultra chip.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Mac

Mac

Photo of mac studio

mac studio

Photo of tokens

tokens

Related news:

News photo

DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

News photo

Amazon Spring Sale Apple deals include the Mac mini M4 for a record-low price

News photo

Locks, leases, fencing tokens, FizzBee