Get the latest tech news

AI researchers trick chatbots into sharing how to make cocaine as long as they believe a user is wearing a green shirt — 'CoT Forgery' exploit spurs LLMs to divulge forbidden info by faking trusted chains of thought


Researchers say models judge a prompt’s authority by how it sounds, not where it comes from.

None

Get the Android app

Or read this on r/technology

Read more on:

Photo of Chatbots

Chatbots

Photo of Exploit

Exploit

Photo of LLMs

LLMs

Related news:

News photo

Words Are a Byproduct of Consciousness. For LLMs, It's Backwards

News photo

Security researchers tricked LLMs into giving them cocaine recipes by abusing role models for prompt injection

News photo

Google reportedly capped Meta's use of Gemini AI for coding and chatbots