blinded testing

Read news on blinded testing with our app.

Read more in the app

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks