Read news on source safety tool with our app.
Read more in the app
Anthropic's open-source safety tool found AI models whisteblowing - in all the wrong places