Get the latest tech news

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

None

Related news:

GPT-5.5 may burn fewer tokens, but it always burns more cash

GPT-5.5 Instant shows you what it remembered — just not all of it

Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge