Read news on testing agents with our app.
Read more in the app
Terminal-Bench 2.0 launches alongside Harbor, a new framework for testing agents in containers