Navigation
Recherche
|
DeepSeek-V3 Now Runs At 20 Tokens Per Second On Mac Studio
mardi 25 mars 2025, 23:20 , par Slashdot/Apple
![]() 'The new DeepSeek-V3-0324 in 4-bit runs at > 20 tokens/second on a 512GB M3 Ultra with mlx-lm!' wrote AI researcher Awni Hannun on social media. While the $9,499 Mac Studio might stretch the definition of 'consumer hardware,' the ability to run such a massive model locally is a major departure from the data center requirements typically associated with state-of-the-art AI. Simon Willison, a developer tools creator, noted in a blog post that a 4-bit quantized version reduces the storage footprint to 352GB, making it feasible to run on high-end consumer hardware like the Mac Studio with M3 Ultra chip. This represents a potentially significant shift in AI deployment. While traditional AI infrastructure typically relies on multiple Nvidia GPUs consuming several kilowatts of power, the Mac Studio draws less than 200 watts during inference. This efficiency gap suggests the AI industry may need to rethink assumptions about infrastructure requirements for top-tier model performance. 'The implications of an advanced open-source reasoning model cannot be overstated,' reports VentureBeat. 'Current reasoning models like OpenAI's o1 and DeepSeek's R1 represent the cutting edge of AI capabilities, demonstrating unprecedented problem-solving abilities in domains from mathematics to coding. Making this technology freely available would democratize access to AI systems currently limited to those with substantial budgets.' 'If DeepSeek-R2 follows the trajectory set by R1, it could present a direct challenge to GPT-5, OpenAI's next flagship model rumored for release in coming months. The contrast between OpenAI's closed, heavily-funded approach and DeepSeek's open, resource-efficient strategy represents two competing visions for AI's future.' Read more of this story at Slashdot.
https://apple.slashdot.org/story/25/03/25/2054214/deepseek-v3-now-runs-at-20-tokens-per-second-on-ma...
Voir aussi |
59 sources (15 en français)
Date Actuelle
mer. 26 mars - 16:11 CET
|