Read news on SWE- with our app.
Read more in the app
SWE-bench Verified no longer measures frontier coding capabilities
15× vs. ~1.37×: Recalculating GPT-5.3-Codex-Spark on SWE-Bench Pro
Qodo CLI agent scores 71.2% on SWE-bench Verified