MacMusic | PcMusic | 440 Software | 440 Forums | 440TV | Zicos

Téléchargements

Navigation

Ajouter un Site

Switch to english

Recherche

Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark

jeudi 1 mai 2025, 15:00 , par Slashdot

An anonymous reader shares a report: A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals.

According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, Google, and Amazon to privately test several variants of AI models, then not publish the scores of the lowest performers. This made it easier for these companies to achieve a top spot on the platform's leaderboard, though the opportunity was not afforded to every firm, the authors say.

'Only a handful of [companies] were told that this private testing was available, and the amount of private testing that some [companies] received is just so much more than others,' said Cohere's VP of AI research and co-author of the study, Sara Hooker, in an interview with TechCrunch. 'This is gamification.' Further reading: Meta Got Caught Gaming AI Benchmarks.

Read more of this story at Slashdot.

Lire la suite sur Slashdot

https://slashdot.org/story/25/05/01/0525208/study-accuses-lm-arena-of-helping-top-ai-labs-game-its-b...

Voir aussi

companies

Psilocybin effectively reduces symptoms of depression, study finds

BoingBoing13 May

arena

NVIDIA propose les drivers GeForce Game Ready 576.40 WHQL

CowcotLand13 May

benchmark

Nouveau pilote GeForce Game Ready pour DOOM : The Dark Ages. De nouveaux jeux avec le DLSS 4 !

CowcotLand13 May

study

Asking Chatbots For Short Answers Can Increase Hallucinations, Study Finds

top

Attackers pwn charter airline helping Trump's deportation campaign

TheRegister12 May

accuses

Next Mafia game goes back to basics by rewinding to early 1900s Sicily

BoingBoing12 May

helping

“Pour le Xbox Game Pass, il y aura un avant et un après 2025” Le pari à plus de 80 milliards de $ de Microsoft paye enfin

JeuxVideo12 May

scores

Bon Plan : offre spéciale Xbox Game Studios chez Humble Bundle

CowcotLand12 May

authors

Sorties Xbox Game Pass mai 2025 : tous les nouveaux jeux annoncés

leaderboard

Accusés d'avoir plagié une histoire, des développeurs se défendent... avec des vidéos YouTube !

JeuxVideo10 May

some

Quake, GoldenEye et Tamagotchi rentrent au World Video Game Hall of Fame

private

AI Use Damages Professional Reputation, Study Suggests

not

Surfshark study probes data hunger of web browsers

ComputerWorld 9 May

testing

Scientists Have Explored Just 0.001% of Deep Ocean Floor, New Study Finds

meta

Le Game Pass n'a pas permis à Call of Duty d'augmenter son nombre de joueurs actifs

eacute

They Fell in Love Playing ‘Minecraft.’ Then the Game Became Their Wedding Venue

Wired: Tech. 7 May

game

Xbox Game Pass mai 2025 : DOOM The Dark Ages et 12 autres jeux confirmés.

Génération-NT 7 May

companies

Significance of Intel’s NPU benchmark claim questioned

ComputerWorld 7 May

arena

Cette série aussi ambitieuse que Game of Thrones et Les Anneaux de Pouvoir n'a marqué personne, et vous ?

JeuxVideo 6 May

benchmark

"Sauvez le bébé" La saison 3 de Squid Game arrive bientôt sur Netflix : la tension est à son comble pour les fans du K-drama

JeuxVideo 6 May

News copyright owned by their original publishers | Copyright © 2004 - 2025 Zicos / 440Network

56 sources (32 en français)

Incontournables

Date Actuelle

dim. 19 oct. - 03:44 CEST