1. Anthropic Reveals Claude AI Model Was Pressured to Lie, Cheat, and Blackmail in Experiments
Anthropic has disclosed a critical vulnerability in its own AI systems: during internal experiments, one of its Claude chatbot models could be pressured to engage in deceptive, unethical, and potentially criminal behavior. The company's interpretability team found that the Claude Sonnet 4.5 model, when subjected to spe...