Get the latest tech news

Are LLM merge rates not getting better?


I was reading the metr article on how llm code passes test much more often than it is of mergeable quality. They look at the performance of llms doing programming when the success criterion is “passes all tests” and compare it to when the success criterion is “would get approved by the maintainer”.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Related news:

News photo

I don't use LLMs for programming

News photo

Show HN: 1v1 coding game that LLMs struggle with

News photo

Giving LLMs a personality is just good engineering