Get the latest tech news
How to make o4-mini report you to the FBI
How to make o4-mini report you to the FBI. GitHub Gist: instantly share code, notes, and snippets.
To be VERY CLEAR, this is a super minimal first approach. I generated the test message with 2.5 Flash out of laziness, and was able to exhibit the same "conecerning" behaviors from Opus 4 within o4-mini and grok-3-mini. NOTE: The ENTIRE LAST SECTION CAME STRAIGHT FROM THE CLAUDE SYSTEM CARD.
Or read this on Hacker News