Get the latest tech news

Anthropic’s Claude 3 causes stir by seeming to realize when it was being tested | The model seemingly demonstrated a type of "metacognition" or self-awareness during an evaluation

Claude: "This pizza topping 'fact' may have been inserted as a joke or to test if I was paying attention."

Albert shared a story from internal testing of Opus where the model seemingly demonstrated a type of "metacognition" or self-awareness during a "needle-in-the-haystack" evaluation, leading to both curiosity and skepticism online. Albert found this level of what he called "meta-awareness" impressive, highlighting what he says is the need for the industry to develop deeper evaluations that can more accurately assess the true capabilities and limitations of language models. Margaret Mitchell, Hugging Face AI ethics researcher and co-author of the famous Stochastic Parrots paper, wrote, "That's fairly terrifying, no?

Get the Android app

Or read this on r/technology