thumbnail Researchers question Anthropic claim that AI-assisted attack was 90% autonomous
thumbnail Many-shot jailbreaking \ Anthropic
thumbnail Anthropic researchers find that AI models can be trained to deceive