Get the latest tech news

New benchmark shows top LLMs struggle in real mental health care


Sword Health releases an open-source, expert-validated framework to rigorously assess the clinical competence of AI for mental health support.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Photo of new benchmark

new benchmark

Photo of struggle

struggle

Related news:

News photo

OpenEvolve: Teaching LLMs to Discover Algorithms Through Evolution

News photo

I misused LLMs to diagnose myself and ended up bedridden for a week

News photo

Launch HN: Mentat (YC F24) – Controlling LLMs with Runtime Intervention