Get the latest tech news

CueBench for Developers is live: score how well you drive coding agents


{{ dashKicker }} {{ dashTitle }} Updated {{ updatedAt }} {{ ac.label }} {{ ac.value }} {{ ac.sub }} {{ heroLabel }}{{ heroBody }} {{ v.label }}{{ v.delta }} {{ v.score }} {{ v.spark }} Cost & efficiency {{ costTotal }} Total spend {{ costAvg }} Avg per session Model usage {{ mr.model }} {{ mr.count }} No session data yet — cost metrics will appear once sessions are recorded. Risk signals {{ riskLoopsDisplay }} Sessions with loops of total sessions flagged {{ riskBelowThreshold }} Review developers → Below threshold developers below {{ alertBelow }}-point target {{ c.tag }} {{ c.initials }} {{ c.name }} {{ c.role }} {{ c.value }} {{ c.valueSub }} People & Skills Developers Updated {{ updatedAt }} ! ALERT {{ alertText }} Review {{ alertCount }} developers Side-by-side comparisonClear × {{ ce.name }} {{ ce.role }} · {{ ce.dept }} {{ cv.label }} {{ cv.score }} Overall {{ ce.overall }} {{ tableTitle }} {{ tableMeta }} {{ filteredCount }} developers {{ col.label }} {{ r.chkCell }} {{ r.rank }} {{ r.initials }} {{ r.dept }}{{ r.teamCell }} {{ r.role }}{{ r.overallCell }} {{ r.v1 }} {{ r.v2 }} {{ r.v3 }} {{ r.v4 }} {{ r.trendCell }} Drop in a session — get scored Drag & drop your Claude Code or Codex session logs (.jsonl), or click to browse.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of developers

developers

Photo of coding agents

coding agents

Photo of CueBench

CueBench

Related news:

News photo

Show HN: TaskPeace – a task queue my AI coding agents pull work from over MCP

News photo

Top ‘Suicide Squad’ Developers Say the Flop Made Them Not Want to Make Games Anymore

News photo

Safari’s new MCP server lets coding agents inspect and debug websites