Max Zeff (@ZeffMax) “NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade perfo” — TopicDigg
Max Zeff
@ZeffMax
Senior Writer covering AI @WIRED, author of the Model Behavior newsletter | Formerly @TechCrunch, @Gizmodo, @markets | DM me off the record on Signal @ mzeff.88
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash.
“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”