Get the latest tech news

The AI Agent Era Requires a New Kind of Game Theory


Zico Kolter, a Carnegie Mellon professor and board member at OpenAI, tells WIRED about the dangers of AI agents interacting with one another—and why models need to be more resistant to attacks.

His research group at Carnegie Mellon University has discovered numerous methods of tricking, goading, and confusing advanced AI models into being their worst selves. We are making progress here, developing much better [defensive] techniques, but if you break the underlying model, you basically have the equivalent to a buffer overflow [a common way to hack software]. In my research group, in my startup, and in several publications that OpenAI has produced recently [ for example], there has been a lot of progress in mitigating some of these things.

Get the Android app

Or read this on Wired

Read more on:

Photo of new kind

new kind

Photo of game theory

game theory

Photo of AI agent era

AI agent era

Related news:

News photo

Game Theory and Settling the Debts of the Deceased in Ancient Times

News photo

World's first "Synthetic Biological Intelligence" runs on living human cells | The CL1, offers a whole new kind of computing intelligence

News photo

Microsoft's Majorana 1 quantum computing chip uses a new kind of superconductor