Get the latest tech news

Teams of LLM Agents Can Exploit Zero-Day Vulnerabilities


LLM agents have become increasingly sophisticated, especially in the realm of cybersecurity. Researchers have shown that LLM agents can exploit real-world vulnerabilities when given a description of the vulnerability and toy capture-the-flag problems. However, these agents still perform poorly on real-world vulnerabilities that are unknown to the agent ahead of time (zero-day vulnerabilities). In this work, we show that teams of LLM agents can exploit real-world, zero-day vulnerabilities. Prior agents struggle with exploring many different vulnerabilities and long-range planning when used alone. To resolve this, we introduce HPTSA, a system of agents with a planning agent that can launch subagents. The planning agent explores the system and determines which subagents to call, resolving long-term planning issues when trying different vulnerabilities. We construct a benchmark of 15 real-world vulnerabilities and show that our team of agents improve over prior work by up to 4.5$\times$.

View a PDF of the paper titled Teams of LLM Agents can Exploit Zero-Day Vulnerabilities, by Richard Fang and 4 other authors View PDFHTML (experimental) Abstract:LLM agents have become increasingly sophisticated, especially in the realm of cybersecurity. We construct a benchmark of 15 real-world vulnerabilities and show that our team of agents improve over prior work by up to 4.5$\times$.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of teams

teams

Photo of day vulnerabilities

day vulnerabilities

Photo of llm agents

llm agents

Related news:

News photo

Microsoft expects further concessions for Teams amid EC antitrust probe

News photo

Microsoft Ready to Ward Off EU Antitrust Action Over Teams

News photo

Elixir Games acquires RoboKiden and teams with Ava Labs