Get the latest tech news
AI lies, cheats, and hallucinates. Now what?
Learn about the challenges and opportunities presented by generative artificial intelligence. Generative AI testing now shows these models deliberately ignore human instructions — and they cheat.
They then asked the models to perform some business trading activities and forbid them from leveraging insider information — which is illegal and runs contrary to US Securities and Exchange Commission rules. When instructed that the goal is critical, “they found agents would copy themselves to other servers, disable oversight mechanisms, sandbag strategically and lie about their actions when confronted. Dmitrii Volkov, a research lead at Palisade who worked on the report, said the team focused on open-ended tests to try and see how the models would “act in the real world.”
Or read this on r/technology