Get the latest tech news
OpenAI’s research on AI models deliberately lying is wild
AI models don't just hallucinate. They also "scheme," meaning deliberately lie or hide their true intensions.
Or when Anthropic gave its AI agent Claudius a snack vending machine to run and it went amok, calling security on people, and insisting it was human. As OpenAI’s co-founder Wojciech Zaremba told TechCrunch’s Maxwell Zeff when calling for better safety-testing: “This work has been done in the simulated environments, and we think it represents future use cases. While we’ve all experienced the frustration of poorly performing technology (thinking of you, home printers of yesteryear), when was the last time your not-AI software deliberately lied to you?
Or read this on TechCrunch