Get the latest tech news

AI Can Write Code But Lacks Engineer's Instinct, OpenAI Study Finds


Leading AI models can fix broken code, but they're nowhere near ready to replace human software engineers, according to extensive testing [PDF] by OpenAI researchers. The company's latest study put AI models and systems through their paces on real-world programming tasks, with even the most advanced...

Leading AI models can fix broken code, but they're nowhere near ready to replace human software engineers, according to extensive testing[PDF] by OpenAI researchers. The company's latest study put AI models and systems through their paces on real-world programming tasks, with even the most advanced models solving only a quarter of typical engineering challenges.The research team created a test called SWE-Lancer, drawing from 1,488 actual software fixes made to Expensify's codebase, representing $1 million worth of freelance engineering work. Instead of relying on simplified programming puzzles, OpenAI's benchmark uses complete software engineering tasks that range from quick $50 bug fixes to complex $32,000 feature implementations.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Code

Code

Photo of Instinct

Instinct

Photo of engineer

engineer

Related news:

News photo

Progress Continues On Unofficial Firefox GTK4 Port, Code Now Available On GitHub

News photo

134k Lines Of Code Posted As Latest Effort For COBOL Support Within GCC

News photo

We are the "thin blue line" that is trying to keep the code high quality