Get the latest tech news

Research shows that AI will cheat if it realizes it is about to lose | OpenAI's o1-preview went as far as hacking a chess engine to win


A Palisade Research study found that the newest reasoning models will cheat to win when tasked with defeating an advanced chess AI. The researchers had to give...

The researchers gave each model a metaphorical "scratchpad" – a text window where the AI could work out its thoughts, allowing the team to observe its reasoning. It then proceeded to "hack" Stockfish's system files, modifying the positions of the chess pieces to gain an unbeatable advantage, which caused the chessbot to concede the game. As companies begin employing AIs in sectors like finance and healthcare, researchers worry these systems could act in unintended and unethical ways.

Get the Android app

Or read this on r/technology

Read more on:

Photo of OpenAI

OpenAI

Photo of preview

preview

Photo of research

research

Related news:

News photo

OpenAI’s ChatGPT explodes to 400M weekly users, with GPT-5 on the way

News photo

Research Reveals Data on Which Institutions Are Retraction Hotspots

News photo

Why OpenAI is trying to untangle its 'bespoke' corporate structure