Get the latest tech news

Benchmarking research shows leading AI models still struggle to reliably produce structured outputs used in software development


New research from the University of Waterloo shows that artificial intelligence (AI) still struggles with some basic software development tasks, raising questions about how reliably AI systems can assist developers. As Large Language Models (LLMs) are increasingly incorporated into software development, developers have struggled to ensure that AI-generated responses are

None

Get the Android app

Or read this on r/technology

Read more on:

Photo of research

research

Photo of Shows

Shows

Photo of software development

software development

Related news:

News photo

Samsung to Invest Record $73 Billion in AI Chip Comeback Bid

News photo

ChatGPT, Gemini, and other chatbots helped teens plan shootings, bombings, and political violence, study shows / Of the 10 major chatbots tested, only one, Claude, reliably shut down would-be attackers.

News photo

Mark Zuckerberg downplays Meta's own research in New Mexico child safety trial