Get the latest tech news

Benchmarking LLM social skills with an elimination game


A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each othe...

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Benchmarking LLM

Benchmarking LLM

Photo of social skills

social skills

Photo of elimination game

elimination game