Navigation
Recherche
|
Intel, Ampere show running LLMs on CPUs isn't as crazy as it sounds
mercredi 1 mai 2024, 13:24 , par TheRegister
If you lower you expectations, of course. Think more Llama2-7B, less GPT-4
Popular generative AI chatbots and services like ChatGPT or Gemini mostly run on GPUs or other dedicated accelerators, but as smaller models are more widely deployed in the enterprise, CPU-makers Intel and Ampere are suggesting their wares can do the job too – and their arguments aren't entirely without merit.…
https://go.theregister.com/feed/www.theregister.com/2024/05/01/intel_ampere_show_running_llms/
|
56 sources (32 en français)
Date Actuelle
dim. 24 nov. - 04:12 CET
|