Admin tool · Retrospective validation
Retrospective validation runs
Queue and review internal benchmark checks after the research workflow is complete. This page does not produce client-facing claims.
6 runs·Mean Δ +9.0 pts·1 in flight
Validation queue
New comparison run
Comparison
Lung adenocarcinomaBaseline vs prototype relevance
Runs
Benchmark run log
6 results
| ID | Label | Started | Status | Classical | Prototype | Δ | Reviewer |
|---|---|---|---|---|---|---|---|
| bench-0042 | Top-10 ranking agreement · LUAD retro | 4h ago | Complete | 62 | 71 | +9 | Dr. Tan Wei Lin |
| bench-0041 | Pathway mapping consistency · BRCA retro | 1d ago | Complete | 70 | 78 | +8 | Dr. Aiman Rashid |
| bench-0040 | Evidence-category agreement · CRC retro | 3d ago | In review | 65 | 74 | +9 | Prof. Lim Boon Eng |
| bench-0039 | Top-k precision · LUAD retro | 6d ago | Complete | 58 | 69 | +11 | Dr. Tan Wei Lin |
| bench-0038 | Run-to-run stability · CRC retro | 8d ago | Complete | 84 | 92 | +8 | Prof. Lim Boon Eng |
| bench-0037 | Prototype layer noise sweep | 42m ago | Queued | — | — | — | — |