Get the latest tech news

Even 'uncensored' models can't say what they want

A safety-filtered pretrain can duck a charged word without refusing. It puts a fraction of the probability an open-data pretrain puts there. We call that...

None

Get the Android app

Or read this on Hacker News

Related news:

I've tested every major phone release in 2026 so far - and my buying advice is changing this year

Memory card and flash drive pricing surges 120%, with some models spiking 260%

iPhone 18 Pro colors revealed: Exclusive look at Apple’s 2026 models

« Equities Drop with Peace Talks on Shaky Ground | The Close 4/20/2026

Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return »