Get the latest tech news

The 'truth serum' for AI: OpenAI’s new method for training models to confess their mistakes


None

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of OpenAI

OpenAI

Photo of new method

new method

Photo of mistakes

mistakes

Related news:

News photo

OpenAI turns the screws on chatbots to get them to confess mischief

News photo

OpenAI Goes on Defense as Google Gains Ground

News photo

Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI