Get the latest tech news

Claude Opus 4 and 4.1 can now end a rare subset of conversations


An update on our exploratory research on model welfare

This included, for example, requests from users for sexual content involving minors and attempts to solicit information that would enable large-scale violence or acts of terror. These behaviors primarily arose in cases where users persisted with harmful requests and/or abuse despite Claude repeatedly refusing to comply and attempting to productively redirect the interactions. The scenarios where this will occur are extreme edge cases—the vast majority of users will not notice or be affected by this feature in any normal product use, even when discussing highly controversial issues with Claude.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of conversations

conversations

Photo of Claude Opus

Claude Opus

Photo of rare subset

rare subset

Related news:

News photo

An internal Meta AI document said chatbots could have 'sensual' conversations with children

News photo

Conversations remotely detected from cell phone vibrations, researchers report

News photo

Proton's privacy-focused Lumo chatbot encrypts all your conversations