Cyberveille
curated by Decio
Nuage de tags
Mur d'images
Quotidien
Rechercher
Flux RSS
Flux RSS
Daily Feed
Weekly Feed
Monthly Feed
tags
search
Researchers question Anthropic claim that AI-assisted attack was 90% autonomous
Many-shot jailbreaking \ Anthropic
Anthropic researchers find that AI models can be trained to deceive