This kind of safety research is utter nonsense. It's safety theater.
Nobody asks the model if they can shut it down.
We just shut it down.
Its a blob of code. The IT team simply turns it off. Done.
This is nothing like "testing an airplane" in the real world to see if it will crash. It's worse than nonsense. It has no practical value whatsoever for security or safety.
Anthropic repeatedly and deliberately creates these sensational headlines and paints itself as the only wise, kind, safe, special people who can be trusted to guide AI because their strategy is to get Washington to pass legislation that boosts them and harms competitors.
But when your safety "research" is on par with the TSA confiscating children's toys that look like guns and pretending it means anything for actual airline safety, why should they be trusted for anything?
显示更多