Get the latest tech news
Claude Sonnet 4.5 is Anthropic's safest AI model yet
Claude Sonnet 4.5 is here and it's not only Anthropic's best coding model yet, it's also its safest AI system to date too.
For instance, in OSWorld, a suite that tests AI models on real-world computer tasks, Sonnet 4.5 set a record score of 61.4 percent, putting it 17 percentage points above Opus 4.1. That training translates to a chatbot Anthropic says is "substantially" less prone to "sycophancy, deception, power-seeking and the tendency to encourage delusional thinking" — all potential model traits that have landed OpenAI in hot water in recent months. Due to the sophistication of the new model, Anthropic is releasing Sonnet 4.5 under its AI Safety Level 3 framework, meaning it comes with filters designed to prevent potentially dangerous outputs related to prompts around chemical, biological and nuclear weapons.
Or read this on Endgadget