Get the latest tech news

Prompt Injection as Role Confusion

LLMs can't tell who's speaking. We show they identify roles by writing style, not tags, and exploit this with CoT Forgery, injecting fake reasoning that models mistake for their own thoughts.

None

Get the Android app

Or read this on Hacker News

Related news:

The theory taking the rich by storm: China funds data center haters

Introducing Boron Buckyballs: Theory that B80 cages can’t be made is disproved

The Kaiser and a "Mediocre Man" Theory of History

« Valve says Steam Machine's price is "significantly more" than it originally envisaged, and the launch quantity is "less than we wanted to be able to make"

Valve's new Steam Machines might be modestly specced but FSR 4 support is a big win for the tiny device »