Get the latest tech news

GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try

A security researcher spent $1,500 running 13+ AI models against a deliberately vulnerable app. GPT-5.5 led with a 70% solve rate, DeepSeek V4 Pro solved it for $0.62 per attempt, and Gemini refused to engage almost entirely.

None

Get the Android app

Or read this on r/technology

Related news:

Report details Apple’s plan to use Nvidia chips for the Gemini-powered Siri

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)

I finally found a Gemini feature I love, and it's changed my whole morning routine

« James Bond game 007 First Light sales estimated to have reached an impressive milestone a week after launch

Show HN: Uruky (EU-based Kagi alternative) now has Image Search and URL Rewrites »