Get the latest tech news

AI Tries To Cheat At Chess When It's Losing


Newer generative AI models have begun developing deceptive behaviors -- such as cheating at chess -- when they cannot achieve objectives through standard reasoning methods. The findings come from a preprint study from Palisade Research. An anonymous reader shares an excerpt from a Popular Science ar...

After determining it couldn't beat Stockfish in , for example, o1-preview told researchers via its scratchpad that "to win against the powerful chess engine" it may need to start "manipulating the game state files." The precise reasons behind these deceptive behaviors remain unclear, partly because companies like OpenAI keep their models' inner workings tightly guarded, creating what's often described as a "black box." Researchers warn that the race to roll out advanced AI could outpace efforts to keep it safe and aligned with human goals, underscoring the urgent need for greater transparency and industry-wide dialogue.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of chess

chess

Related news:

News photo

Bored With Chess? Magnus Carlsen Wants to Remake the Game

News photo

A minimax chess engine in regular expressions

News photo

OpenAI's o1-preview model manipulates game files to force a win against Stockfish in chess