Get the latest tech news

OpenAI is training models to 'confess' when they lie - what it means for future AI

A new study made a version of GPT-5 Thinking admit its own misbehavior. But it's not a quick fix for bigger safety issues.

None

Related news:

OpenAI, NextDC Plan to Build $4.6 Billion Sydney Data Center

The 'truth serum' for AI: OpenAI’s new method for training models to confess their mistakes

OpenAI turns the screws on chatbots to get them to confess mischief