Read news on bad behaviors with our app.
Read more in the app
OpenAI's new confession system teaches models to be honest about bad behaviors
AI models may be accidentally (and secretly) learning each other’s bad behaviors