Anthropic Added a New Security Measure to Get Back Into the Trump Administration’s Good Graces
The Trump administration lifted export controls on Anthropic’s Claude Fable 5 AI model after the company agreed to extend an existing guardrail to prevent users from trying to access certain restricted capabilities, according to two people familiar with the matter. The safeguard means any users trying to unlock those capabilities will be notified that their request is blocked and will have their query processed by the less-advanced Opus 4.8 AI model, the people say. Before Anthropic cut off access to Fable 5, user requests related to sensitive cybersecurity and biology capabilities were supposed to be processed by Opus 4.8. The new safeguard, the people say, will extend this guardrail to requests related to a specific behavior identified in a paper by Amazon. According to an analysis published by Katie Moussouris, founder and CEO of Luta Security, after reading the Amazon paper, users were able to get around a restriction on Fable 5 by asking the model to fix code, rather than identify security issues in it. While cybersecurity experts generally don’t find this behavior troubling, …

