Get the latest tech news

Major AI models are easily jailbroken and manipulated, new report finds


Easily jailbroken models show safeguards are failing.

Researchers used prompts in line with industry standard benchmark testing, but found that some AI models didn't even need jailbreaking in order to produce out-of-line responses. The investigation also assessed the capabilities of LLM agents, or AI models used to perform specific tasks, to conduct basic cyber attack techniques. On May 18, OpenAI CEO Sam Altman and president and co-founder Greg Brockman responded to the resignations and growing public concern, writing, "We have been putting in place the foundations needed for safe deployment of increasingly capable systems.

Get the Android app

Or read this on Mashable

Read more on:

Photo of new report

new report

Photo of Major AI models

Major AI models

Related news:

News photo

Call of Duty will come to Xbox Game Pass, says new report

News photo

NASA’s Orion Capsule Heat Shield Wore Away in More Than 100 Places During 2022 Test Flight, Posing ‘Significant Risks’ A new report highlights safety issues that NASA must address before using the spacecraft to send astronauts to the moon, and the agency is already working on fixing the problems.

News photo

iOS 18's Rumored AI Features for Siri, Spotlight, and More Revealed in New Report