Promote hybrid to the default strategy using the stronger cap24 evidence
Persist the larger real-FMA benchmark result showing hybrid clearly outperforming high_energy, so the project recommendation can converge on one default instead of an unresolved tie. Constraint: Only docs change because benchmark outputs remain outside version control Rejected: Keep treating hybrid and high_energy as co-equal defaults | The larger 24-track capped benchmark now separates them clearly Confidence: high Scope-risk: narrow Directive: Use cap24 top-two as the current strongest public evidence until a larger capped benchmark supersedes it Tested: Verified /tmp/ab_smoke_seg_cap24_top2/report.json; verified high_energy eval.json; verified docs now state hybrid=16/1.0/1.0 and high_energy=16/0.8125/1.0 Not-tested: Broader strategy comparison beyond hybrid vs high_energy on the 24-track subset
Showing
3 changed files
with
38 additions
and
4 deletions
-
Please register or sign in to post a comment