Commit 29c1962c 29c1962c7d54f4c0dab1f0b45f4f6f74958d8e64 by cnb.bofCdSsphPA

Capture fresh high-energy benchmark evidence in the restart handoff

Update the handoff and changelog with the newly finished capped FMA high_energy result so the next session starts from current evidence instead of stale partials.

Constraint: Benchmark is still running overall and only partial strategies are complete
Rejected: Wait for repeated_section_aware to finish before updating handoff | Risks another stale restart gap
Confidence: high
Scope-risk: narrow
Directive: Replace the partial cap16 table with the final ranking once repeated_section_aware and report.json land
Tested: Verified /tmp/ab_smoke_seg_cap16/high_energy/fma_reports_smoke/eval.json; verified docs now record high_energy = 12 / 1.0 / 1.0
Not-tested: Final cap16 multi-strategy report because repeated_section_aware is still in progress
1 parent 2c909862
......@@ -15,14 +15,16 @@
- 写入当前已拿到的 partial result:
- `hybrid`: `num_queries=12`, `top1=1.0`, `topk=1.0`
- `beat_aware`: `num_queries=12`, `top1=0.9167`, `topk=1.0`
- `high_energy`: `num_queries=12`, `top1=1.0`, `topk=1.0`
验证结果:
- 当前进程确认:
- `scripts/ab_smoke_segmentation.py ... --work-root /tmp/ab_smoke_seg_cap16`
- `high_energy` 策略仍在进行中
- `repeated_section_aware` 策略仍在进行中
- 已落盘评测文件确认:
- `/tmp/ab_smoke_seg_cap16/hybrid/fma_reports_smoke/eval.json`
- `/tmp/ab_smoke_seg_cap16/beat_aware/fma_reports_smoke/eval.json`
- `/tmp/ab_smoke_seg_cap16/high_energy/fma_reports_smoke/eval.json`
结论:
- 当前 session 即使立即中断,也已经具备可恢复的续跑交接材料
......
......@@ -337,7 +337,7 @@ cd /workspace/acr-engine
|---|---:|---:|---:|---|
| `hybrid` | 12 | 1.0 | 1.0 | 已完成 |
| `beat_aware` | 12 | 0.9167 | 1.0 | 已完成 |
| `high_energy` | - | - | - | 进行中 |
| `high_energy` | 12 | 1.0 | 1.0 | 已完成 |
| `repeated_section_aware` | - | - | - | 未开始/未完成 |
### 重启后第一优先动作
......