Anthropic's Claude vulnerable to 'emotional manipulation'

samedi 12 octobre 2024, 12:30 , par TheRegister

AI model safety only goes so far
Anthropic's Claude 3.5 Sonnet, despite its reputation as one of the better behaved generative AI models, can still be convinced to emit racist hate speech and malware.…

Lire la suite sur TheRegister

https://go.theregister.com/feed/www.theregister.com/2024/10/12/anthropics_claude_vulnerable_to_emoti...

56 sources (32 en français)

Tech/PC

BetaNews
BoingBoing
ComputerWorld
eWeek
ExtremeTech
InfoWorld
InternetNews
LWN.net
Microsoft Watch
OS News
PC Magazine
SourceForge
The Inquirer

Incontournables

01net
Ars Technica
CNET News
NetEco
PC World
Silicon.fr
Slashdot
TheRegister
VNUnet
Wired: Tech.
ZDNet
ZDNet.fr

Général

Clubic
CowcotLand
Geek
Génération-NT
InformaNews
Infos-du-Net
New Dimension FR
PC Inpact
Presence-PC
Revioo

Matériel

Best of Micro
Blue-Hardware
HardWare.fr
Materiel.be
Tom's Hardware
TT-Hardware

Linux

DistroWatch
LinuxFR
TooLinux

Sécurité

ABC de la Sécurité Info.
Ratiatum.com
SecurityFocus
Zataz

Jeux

Factornews
Gamebe
GameKult
JeuxVideo
LudoMac
Mondes Persistants
NoFrag
ZeDen.net

Date Actuelle

mer. 17 déc. - 01:06 CET