Get the latest tech news
Benchmarking LLM social skills with an elimination game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each othe...
Or read this on Hacker News