Get the latest tech news

Anthropic’s Claude 3 causes stir by seeming to realize when it was being tested | The model seemingly demonstrated a type of "metacognition" or self-awareness during an evaluation


Claude: "This pizza topping 'fact' may have been inserted as a joke or to test if I was paying attention."

Albert shared a story from internal testing of Opus where the model seemingly demonstrated a type of "metacognition" or self-awareness during a "needle-in-the-haystack" evaluation, leading to both curiosity and skepticism online. Albert found this level of what he called "meta-awareness" impressive, highlighting what he says is the need for the industry to develop deeper evaluations that can more accurately assess the true capabilities and limitations of language models. Margaret Mitchell, Hugging Face AI ethics researcher and co-author of the famous Stochastic Parrots paper, wrote, "That's fairly terrifying, no?

Get the Android app

Or read this on r/technology

Read more on:

Photo of model

model

Photo of type

type

Photo of evaluation

evaluation

Related news:

News photo

Anthropic’s Claude 3 knew when researchers were testing it

News photo

Meta's pay-or-consent model hides 'massive illegal data processing ops': lawsuit | GDPR claim alleges Facebook parent's 'commercial surveillance practices are fundamentally illegal'

News photo

Second Galaxy Z Fold 6 model rumored to go 'Ultra,' not cheaper