Get the latest tech news

How a Seemingly Harmless Image Can Jailbreak Vision-Language AI Models


Slashdot reader BrianFagioli writes: Florida International University researchers have developed a technique called JaiLIP (Jailbreaking with Loss-guided Image Perturbation) that uses subtle image modifications to bypass AI safety guardrails. Unlike traditional jailbreaks that rely on carefully craf...

None

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Models

Models

Photo of language

language

Photo of harmless image

harmless image

Related news:

News photo

The third Xbox price hike in 15 months raises all models by at least $100

News photo

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

News photo

OpenAI unveils GPT-5.6 Sol, Terra and Luna models — but only accessible to limited preview partners for now, per US Gov