| Name |
Last Commit
d4961b14
–
pin down the hard-case gap after the first real-path closure\n\nConstraint: Handoff must distinguish clean real-path evidence from hard-case evidence without staging temporary evaluation artifacts\nRejected: Keep scaling clean-only FMA smoke first | Fresh evidence shows the next highest-yield work is hard-case top1 improvement\nConfidence: high\nScope-risk: narrow\nDirective: Treat humming_like and confused as the primary optimization targets before investing more cycles in larger clean-only smoke runs\nTested: Audited manifest type coverage, verified synthetic_v2 hard-case evaluate results, and updated handoff/changelog docs with the gap analysis\nNot-tested: Post-optimization hard-case improvements on real open-data derived hard cases
|
History
|
Last Update |
|---|---|---|
| .. | ||
| artifact_smoke | Loading commit data... | |
| external_bootstrap | Loading commit data... | |
| external_ingested | Loading commit data... | |
| external_smoke | Loading commit data... | |
| index_open_smoke_fixed | Loading commit data... | |
| index_v3 | Loading commit data... | |
| index_v4 | Loading commit data... | |
| index_v5 | Loading commit data... | |
| index_v6 | Loading commit data... | |
| models_open_smoke_fixed | Loading commit data... | |
| models_v3 | Loading commit data... | |
| models_v4 | Loading commit data... | |
| models_v5 | Loading commit data... | |
| models_v6 | Loading commit data... | |
| raw | Loading commit data... | |
| synthetic_v2 | Loading commit data... |