Skip to content
← All content

Created from a single voice note with Agent Craft

X (Twitter)

Someone sat down with Anthropic's Fable 5 and Mythos on day one and…

X (Twitter) post

Someone sat down with Anthropic's Fable 5 and Mythos on day one and figured out how to break it. Not figuratively. They jailbroke the safeguards. And within a day, the US government had seen enough. Both models pulled. Gone. Think about what that actually means for a second. Anthropic builds what is arguably one of the most powerful AI models we've seen to date. Releases it. And within 24 hours, people had already turned it into something so dangerous that it became a national security concern. Not a product risk. Not a PR headache. A genuine threat to general users and to the country. That's not a small thing. I understand why jailbreakers exist. There's a whole culture around it, people treating safety guardrails like a puzzle to solve. And look, I get the curiosity. I'm curious too. But there's a line between testing the limits of a tool and weaponising it. The people who broke Fable 5 crossed that line fast. What gets me is the speed of it. A single day. That's all it took for something designed with safeguards, with intention, with serious engineering behind it, to become something the government had to intervene on. Anthropic didn't want to pull it. That's years of work, teams of researchers, enormous investment. Nobody builds something and wants to see it disappear overnight. But here we are. And the government did what it had to. At the point where a consumer AI model becomes a risk to national security, there's no version of the story where you leave it running. That decision was right. It had to happen. But it does raise something I think about a lot. The race between building powerful AI and building safe AI is not a comfortable one to watch right now. We're moving fast. Genuinely fast. And the jailbreaking community is moving just as fast in the opposite direction. For every guardrail, there's a community working out how to step around it. That's the reality we're operating in. Luckily, what this situation proves is that oversight is real. The US government isn't asleep. Anthropic wasn't left to handle this alone. The system, messy as it is, responded. That matters. What concerns me more than the takedown itself is what it signals about where we are. If a model can be jailbroken into being one of the most dangerous tools out there in under a day, we haven't solved the alignment problem. We haven't even come close. And the bits and pieces of safety architecture being built right now are clearly not enough on their own. I've said before that AI speeds things up. It absolutely does. But speed without control isn't progress. It's exposure. And the people who broke Fable 5 didn't just put themselves at risk, they put the whole thing at risk. Every researcher working on responsible AI, every company trying to build something useful and safe, every user who just wants a better tool, all of it takes a hit when something like this happens. The model is gone now. What comes next from Anthropic will be built differently, I'm sure of it. The learned-from-it moments in AI development are usually painful ones. This one qualifies. But don't let the takedown be the whole story in your head. The real story is that unchecked capability, in the wrong hands, even for 24 hours, is enough to force a government intervention. That should sit with people who think jailbreaking is just a game. It isn't.

James GoddardJun 14, 2026Published to X — @JamesGodda75737View original ↗

More content from Agent Craft