Read news on noisy LLM evaluators with our app.
Read more in the app
Even (very) noisy LLM evaluators are useful for improving AI agents