OTelBench

Read news on OTelBench with our app.

Read more in the app

OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)