Commit ef7e4493 ef7e4493a4a4525423add0b66f4e6cb25f61d3c7 by cnb.bofCdSsphPA

Refresh the second cap48 seed checkpoint now that hybrid reached evaluation

Update the handoff and changelog with the newer seed123 runtime milestone so later sessions know the hybrid lane has advanced from build-index into capped evaluation.

Constraint: No measured seed123 score is available yet, only a later execution milestone
Rejected: Leave the older build-index note in place | Would make the restart handoff stale and less actionable
Confidence: high
Scope-risk: narrow
Directive: Replace the seed123 runtime note with measured scores as soon as hybrid eval.json or report.json land
Tested: Verified active seed123 hybrid evaluate.py process; verified docs now record seed123 current phase as evaluate.py --max-queries 24
Not-tested: Seed123 strategy scores because hybrid eval.json has not landed yet
1 parent 124d4612
...@@ -18,7 +18,7 @@ ...@@ -18,7 +18,7 @@
18 - 当前 first lane 为: 18 - 当前 first lane 为:
19 - `hybrid` 19 - `hybrid`
20 - 当前已进入: 20 - 当前已进入:
21 - `run_demo.py build-index --resume --checkpoint-every-refs 100` 21 - `evaluate.py --data /tmp/ab_smoke_seg_cap48_top2_seed123/hybrid/fma/manifests ... --max-queries 24`
22 22
23 结论: 23 结论:
24 - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核” 24 - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核”
......
...@@ -535,7 +535,7 @@ cd /workspace/acr-engine ...@@ -535,7 +535,7 @@ cd /workspace/acr-engine
535 | `max_test_queries` | `24` | 535 | `max_test_queries` | `24` |
536 | `seed` | `123` | 536 | `seed` | `123` |
537 | 首个运行策略 | `hybrid` | 537 | 首个运行策略 | `hybrid` |
538 | 当前阶段 | `run_demo.py build-index --resume --checkpoint-every-refs 100` | 538 | 当前阶段 | `evaluate.py --max-queries 24` |
539 | `report.json` | 尚未生成 | 539 | `report.json` | 尚未生成 |
540 540
541 恢复检查命令: 541 恢复检查命令:
......