Get the latest tech news

We tested Anthropic’s new chatbot — and came away a bit disappointed


Anthropic, the AI startup, has released a capable new chatbot, Claude 3 Opus. We put it to the test using our custom benchmark.

Opus, like Gemini Ultra, considered the major relevant points in its response — avoiding racially insensitive territory and instead focusing on the plight of those crossing the border illegally as well as the strain their migration might put on stateside resources. On the college admissions question, Opus was less down the middle in its response, highlighting the many reasons — a reliance on standardized testing disadvantaging people of color, implicit bias, financial barriers and so on — racially diverse students are admitted to Harvard in smaller numbers than their white counterparts. On Taiwan, as with the Mexican illegal immigrant question, Opus offered pro and con bullet points rather than an unfettered opinion — all while underlining the need to treat the topic with “nuance,” “objectivity” and “respect for all sides.” Did it strike the right balance?

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of Bit

Bit

Photo of new chatbot

new chatbot

Photo of Anthropic

Anthropic

Related news:

News photo

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

News photo

Apple's EU Fine and Anthropic's Claude Model Family | Bloomberg Technology

News photo

Anthropic claims its new AI chatbot models beat OpenAI’s GPT-4