| Name |
Last Commit
|
History
|
Last Update |
|---|---|---|
| .. | ||
| artifact-manifest.json | ||
| benchmark-report.md | ||
| config.json | ||
| eval-fusion-tuned.json | ||
| eval.json | ||
| model-card.md | ||
| release-checklist.md |
| Name |
Last Commit
d4961b14
–
pin down the hard-case gap after the first real-path closure\n\nConstraint: Handoff must distinguish clean real-path evidence from hard-case evidence without staging temporary evaluation artifacts\nRejected: Keep scaling clean-only FMA smoke first | Fresh evidence shows the next highest-yield work is hard-case top1 improvement\nConfidence: high\nScope-risk: narrow\nDirective: Treat humming_like and confused as the primary optimization targets before investing more cycles in larger clean-only smoke runs\nTested: Audited manifest type coverage, verified synthetic_v2 hard-case evaluate results, and updated handoff/changelog docs with the gap analysis\nNot-tested: Post-optimization hard-case improvements on real open-data derived hard cases
|
History
|
Last Update |
|---|---|---|
| .. | ||
| artifact-manifest.json | Loading commit data... | |
| benchmark-report.md | Loading commit data... | |
| config.json | Loading commit data... | |
| eval-fusion-tuned.json | Loading commit data... | |
| eval.json | Loading commit data... | |
| model-card.md | Loading commit data... | |
| release-checklist.md | Loading commit data... |