Bench Pro

Read news on Bench Pro with our app.

Read more in the app

15× vs. ~1.37×: Recalculating GPT-5.3-Codex-Spark on SWE-Bench Pro