Get the latest tech news

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole


None

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of Claude Opus

Claude Opus

Photo of gpt-5.5

gpt-5.5

Photo of DeepSWE

DeepSWE

Related news:

News photo

GPT-5.5 may burn fewer tokens, but it always burns more cash

News photo

GPT-5.5 Instant shows you what it remembered — just not all of it

News photo

Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge