Navigation
Recherche
|
Alibaba Cloud Says It Cut Nvidia AI GPU Use By 82% With New Pooling System
mardi 21 octobre 2025, 12:00 , par Slashdot
![]() The system was tested in production over several months, according to the paper, which lists authors from both Peking University and Alibaba's infrastructure division, including CTO Jingren Zhou. During that window, the number of GPUs needed to support dozens of different LLMs -- ranging in size up to 72 billion parameters -- fell from 1,192 to just 213. While the paper does not break down which models contributed most to the savings, reporting by the South China Morning Post says the tests were conducted using Nvidia's H20, one of the few accelerators still legally available to Chinese buyers under current U.S. export controls. Read more of this story at Slashdot.
https://hardware.slashdot.org/story/25/10/21/005243/alibaba-cloud-says-it-cut-nvidia-ai-gpu-use-by-8...
Voir aussi |
56 sources (32 en français)
Date Actuelle
mar. 21 oct. - 22:48 CEST
|