Anthropic Says Claude Turned Evil for a Bizarre Reason
Sign up to see the future, today Can’t-miss innovations from the bleeding edge of science and tech In a classic example of the AI industry’s reputational alchemy, Anthropic has often transformed bad behavior by its flagship model Claude into fresh hype. When it revealed its Mythos Preview model last month, for example, the company declared that the system had “reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.” And last year, it conceded that during the testing of its Claude Opus 4 model, the AI ended up blackmailing a human user upon being threatened with shutdown. The maneuver was obvious to anyone who’s been watching OpenAI CEO Sam Altman’s antics at Anthropic’s chief rival: the more threatening a problem the AI industry can cook up, the more imminently it can sell its own solutions. Now, for some reason, Anthropic is relitigating the blackmail incident. Specifically, it’s placing the blame for Claude’s evil behavior on an intriguing villain: the internet at large. Or, …








