Refresh the second cap48 seed checkpoint now that hybrid reached evaluation
Update the handoff and changelog with the newer seed123 runtime milestone so later sessions know the hybrid lane has advanced from build-index into capped evaluation. Constraint: No measured seed123 score is available yet, only a later execution milestone Rejected: Leave the older build-index note in place | Would make the restart handoff stale and less actionable Confidence: high Scope-risk: narrow Directive: Replace the seed123 runtime note with measured scores as soon as hybrid eval.json or report.json land Tested: Verified active seed123 hybrid evaluate.py process; verified docs now record seed123 current phase as evaluate.py --max-queries 24 Not-tested: Seed123 strategy scores because hybrid eval.json has not landed yet
Showing
2 changed files
with
2 additions
and
2 deletions
| ... | @@ -18,7 +18,7 @@ | ... | @@ -18,7 +18,7 @@ |
| 18 | - 当前 first lane 为: | 18 | - 当前 first lane 为: |
| 19 | - `hybrid` | 19 | - `hybrid` |
| 20 | - 当前已进入: | 20 | - 当前已进入: |
| 21 | - `run_demo.py build-index --resume --checkpoint-every-refs 100` | 21 | - `evaluate.py --data /tmp/ab_smoke_seg_cap48_top2_seed123/hybrid/fma/manifests ... --max-queries 24` |
| 22 | 22 | ||
| 23 | 结论: | 23 | 结论: |
| 24 | - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核” | 24 | - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核” | ... | ... |
| ... | @@ -535,7 +535,7 @@ cd /workspace/acr-engine | ... | @@ -535,7 +535,7 @@ cd /workspace/acr-engine |
| 535 | | `max_test_queries` | `24` | | 535 | | `max_test_queries` | `24` | |
| 536 | | `seed` | `123` | | 536 | | `seed` | `123` | |
| 537 | | 首个运行策略 | `hybrid` | | 537 | | 首个运行策略 | `hybrid` | |
| 538 | | 当前阶段 | `run_demo.py build-index --resume --checkpoint-every-refs 100` | | 538 | | 当前阶段 | `evaluate.py --max-queries 24` | |
| 539 | | `report.json` | 尚未生成 | | 539 | | `report.json` | 尚未生成 | |
| 540 | 540 | ||
| 541 | 恢复检查命令: | 541 | 恢复检查命令: | ... | ... |
-
Please register or sign in to post a comment