MacMusic | PcMusic | 440 Software | 440 Forums | 440TV | Zicos

Téléchargements

Navigation

Ajouter un Site

Switch to english

Recherche

Simple Text Additions Can Fool Advanced AI Reasoning Models, Researchers Find

vendredi 4 juillet 2025, 19:00 , par Slashdot

Researchers have discovered that appending irrelevant phrases like 'Interesting fact: cats sleep most of their lives' to math problems can cause state-of-the-art reasoning AI models to produce incorrect answers at rates over 300% higher than normal [PDF]. The technique -- dubbed 'CatAttack' by teams from Collinear AI, ServiceNow, and Stanford University -- exploits vulnerabilities in reasoning models including DeepSeek R1 and OpenAI's o1 family. The adversarial triggers work across any math problem without changing the problem's meaning, making them particularly concerning for security applications.

The researchers developed their attack method using a weaker proxy model (DeepSeek V3) to generate text triggers that successfully transferred to more advanced reasoning models. Testing on 225 math problems showed the triggers increased error rates significantly across different problem types, with some models like R1-Distill-Qwen-32B reaching combined attack success rates of 2.83 times baseline error rates. Beyond incorrect answers, the triggers caused models to generate responses up to three times longer than normal, creating computational slowdowns. Even when models reached correct conclusions, response lengths doubled in 16% of cases, substantially increasing processing costs.

Read more of this story at Slashdot.

Lire la suite sur Slashdot

https://tech.slashdot.org/story/25/07/04/1521245/simple-text-additions-can-fool-advanced-ai-reasonin...

Voir aussi

models

WeTransfer Backtracks on Terms Suggesting User Files Could Train AI Models After Backlash

reasoning

Former Top Google Researchers Have Made a New Kind of AI Agent

Wired: Tech.16 Jul

researchers

The simple Android screen app that saves my sanity

ComputerWorld16 Jul

rates

Hugging Face Is Hosting 5,000 Nonconsensual AI Models of Real People

triggers

Facebook introduces the biggest change to text posts in years

math

Thermaltake GR300, un cockpit simple et bien équipé pour se lancer ?

CowcotLand15 Jul

advanced

Building a simple router with OpenBSD

text

Researchers Develop New Tool To Measure Biological Age

fool

Tech to protect images against AI scrapers can be beaten, researchers show

TheRegister11 Jul

simple

CVSS 10 RCE in Wing FTP exploited within 24 hours, security researchers warn

TheRegister11 Jul

problems

23 ans après la sortie de Zelda Wind Waker, il réalise qu'il existe une façon bien plus simple de jouer au jeu !

JeuxVideo11 Jul

times

Google fusionne Veo 3 avec Gemini : vous pouvez créer des vidéos à partir d’une simple image

incorrect

Les business mobiles que l’on peut lancer avec un simple smartphone

TT-Hardware10 Jul

answers

Windows 11 : comment convertir des fichiers audio et vidéo avec un simple raccourci clavier ?

01net10 Jul

normal

Google releases Gemma 3n models for on-device AI

InfoWorld10 Jul

models

Hybrid Model Reveals People Act Less Rationally In Complex Games, More Predictably In Simple Ones

reasoning

Clarifai AI Runners connect local models to cloud

InfoWorld 9 Jul

researchers

Microsoft slashes prices 60% on genAI tech that understands audio, video, and text

ComputerWorld 8 Jul

rates

Une simple injection pourrait rendre l'ouïe en quelques semaines !

Génération-NT 8 Jul

triggers

Cat content disturbs AI models

ComputerWorld 8 Jul

News copyright owned by their original publishers | Copyright © 2004 - 2025 Zicos / 440Network

56 sources (32 en français)

Incontournables

Date Actuelle

ven. 15 août - 08:36 CEST