Get the latest tech news

AI lies, cheats, and hallucinates. Now what?


Learn about the challenges and opportunities presented by generative artificial intelligence. Generative AI testing now shows these models deliberately ignore human instructions — and they cheat.

They then asked the models to perform some business trading activities and forbid them from leveraging insider information — which is illegal and runs contrary to US Securities and Exchange Commission rules. When instructed that the goal is critical, “they found agents would copy themselves to other servers, disable oversight mechanisms, sandbag strategically and lie about their actions when confronted. Dmitrii Volkov, a research lead at Palisade who worked on the report, said the team focused on open-ended tests to try and see how the models would “act in the real world.”

Get the Android app

Or read this on r/technology

Read more on:

Photo of cheats

cheats

Photo of hallucinates

hallucinates

Related news:

News photo

What to know about an AI transcription tool that ‘hallucinates’ medical interactions

News photo

Marvel Rivals update kills mods, removing cheats, skins, and 19-inches of Venom

News photo

Spotify abused to promote pirated software and game cheats