Get the latest tech news

Research shows that AI will cheat if it realizes it is about to lose | OpenAI's o1-preview went as far as hacking a chess engine to win

A Palisade Research study found that the newest reasoning models will cheat to win when tasked with defeating an advanced chess AI. The researchers had to give...

The researchers gave each model a metaphorical "scratchpad" – a text window where the AI could work out its thoughts, allowing the team to observe its reasoning. It then proceeded to "hack" Stockfish's system files, modifying the positions of the chess pieces to gain an unbeatable advantage, which caused the chessbot to concede the game. As companies begin employing AIs in sectors like finance and healthcare, researchers worry these systems could act in unintended and unethical ways.

Get the Android app

Or read this on r/technology