Navigation
Recherche
|
When AI is trained for treachery, it becomes the perfect agent
lundi 29 septembre 2025, 09:15 , par TheRegister
We’re blind to malicious AI until it hits. We can still open our eyes to stopping it
Opinion Last year, The Register reported on AI sleeper agents. A major academic study explored how to train an LLM to hide destructive behavior from its users, and how to find it before it triggered. The answers were unambiguously asymmetric — the first is easy, the second very difficult. Not what anyone wanted to hear.…
https://go.theregister.com/feed/www.theregister.com/2025/09/29/when_ai_is_trained_for/
Voir aussi |
56 sources (32 en français)
Date Actuelle
mar. 30 sept. - 06:21 CEST
|