Get the latest tech news

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds


“We asked AI models to do a simple task,” researchers said. “Instead, they defied their instructions … to preserve their peers.”

None

Get the Android app

Or read this on r/technology

Read more on:

Photo of users

users

Photo of Study

Study

Photo of orders

orders

Related news:

News photo

A Secure Chat App’s Encryption Is So Bad It Is "Meaningless" | TeleGuard is an app downloaded more a million times that markets itself as a secure way to chat. The app uploads users’ private keys to the company’s server, and makes decryption of messages trivial.

News photo

'Cognitive Surrender' Leads AI Users To Abandon Logical Thinking, Research Finds

News photo

Data centers are so hot their ‘heat island’ effect is raising temperatures up to 6 miles away and impacting 343 million people worldwide, study finds