Navigation
Recherche
|
LegalPwn: Tricking LLMs by burying badness in lawyerly fine print
lundi 1 septembre 2025, 11:45 , par TheRegister
Trust and believe – AI models trained to see 'legal' doc as super legit
Researchers at security firm Pangea have discovered yet another way to trivially trick large language models (LLMs) into ignoring their guardrails. Stick your adversarial instructions somewhere in a legal document to give them an air of unearned legitimacy – a trick familiar to lawyers the world over.…
https://go.theregister.com/feed/www.theregister.com/2025/09/01/legalpwn_ai_jailbreak/
Voir aussi |
56 sources (32 en français)
Date Actuelle
mer. 3 sept. - 03:08 CEST
|