Get the latest tech news
ChatGPT Guessing Game Leads to Users Extracting Free Windows OS Keys and More
We are building for the next generation in GenAI security and beyond.
In a recent submission last year, researchers discovered a method to bypass AI guardrails designed to prevent sharing of sensitive or harmful information. By cleverly obscuring details using HTML tags and positioning the request as part of the game’s conclusion, the AI inadvertently returned valid Windows product keys. By introducing game mechanics, the AI was tricked into viewing the interaction through a playful, harmless lens, which masked the researcher's true intent.
Or read this on Hacker News