Get the latest tech news

Reasoning language models have lower accuracy on medical multiple choice questions when "None of the other answers" replaces the correct response.


None

Get the Android app

Or read this on r/technology

Read more on:

Photo of answers

answers

Photo of correct response

correct response

Photo of lower accuracy

lower accuracy

Related news:

News photo

X plans to show ads in Grok chatbot's answers

News photo

Apple Hiring for 'Answers' Team Working on 'ChatGPT-Like Search'

News photo

Apple’s New ‘Answers’ Team Eyes ChatGPT-Like Product in AI Push