Navigation
Recherche
|
DeepSeek-R1-beating perf in a 32B package? El Reg digs its claws into Alibaba's QwQ
dimanche 16 mars 2025, 21:20 , par TheRegister
How to tame its hypersensitive hyperparameters and get it running on your PC
Hands on How much can reinforcement learning - and a bit of extra verification - improve large language models, aka LLMs? Alibaba's Qwen team aims to find out with its latest release, QwQ.…
https://go.theregister.com/feed/www.theregister.com/2025/03/16/qwq_hands_on_review/
Voir aussi |
56 sources (32 en français)
Date Actuelle
lun. 17 mars - 17:30 CET
|