Get the latest tech news

Even 'uncensored' models can't say what they want


A safety-filtered pretrain can duck a charged word without refusing. It puts a fraction of the probability an open-data pretrain puts there. We call that...

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Models

Models

Related news:

News photo

I've tested every major phone release in 2026 so far - and my buying advice is changing this year

News photo

Memory card and flash drive pricing surges 120%, with some models spiking 260%

News photo

iPhone 18 Pro colors revealed: Exclusive look at Apple’s 2026 models