Checkpoint the second cap48 seed while the rerun is still building
Preserve the second-seed cap48 entry point and current build-index phase so later sessions can validate whether the cap48 reversal was stable or a seed artifact. Constraint: The second-seed run has not produced scores yet, so only execution-state evidence is available Rejected: Wait for the seed123 scores before recording anything | Risks losing the multi-seed validation checkpoint if the session ends first Confidence: high Scope-risk: narrow Directive: Replace the seed123 running-state section with measured scores once hybrid eval.json or report.json land Tested: Verified active cap48 seed123 processes; verified handoff records work-root, seed, subset size, query cap, and current build-index phase Not-tested: cap48 seed123 strategy scores because the run is still in progress
Showing
2 changed files
with
62 additions
and
0 deletions
| ... | @@ -2,6 +2,28 @@ | ... | @@ -2,6 +2,28 @@ |
| 2 | 2 | ||
| 3 | ## 2026-06-02 | 3 | ## 2026-06-02 |
| 4 | 4 | ||
| 5 | ### Stage: 启动 cap48 第二个 seed 复核反转结果 | ||
| 6 | |||
| 7 | 完成项: | ||
| 8 | - 启动第二个 seed 的 cap48 top2 benchmark: | ||
| 9 | - `work_root = /tmp/ab_smoke_seg_cap48_top2_seed123` | ||
| 10 | - `subset_size = 48` | ||
| 11 | - `max_test_queries = 24` | ||
| 12 | - `seed = 123` | ||
| 13 | - 策略:`hybrid` vs `high_energy` | ||
| 14 | - 更新 [session-handoff.md](./session-handoff.md) | ||
| 15 | |||
| 16 | 当前 fresh evidence: | ||
| 17 | - 第二个 seed 已启动 | ||
| 18 | - 当前 first lane 为: | ||
| 19 | - `hybrid` | ||
| 20 | - 当前已进入: | ||
| 21 | - `run_demo.py build-index --resume --checkpoint-every-refs 100` | ||
| 22 | |||
| 23 | 结论: | ||
| 24 | - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核” | ||
| 25 | - 即使当前 session 结束,新 session 也可直接从 handoff 中的 `cap48_top2_seed123` 入口继续 | ||
| 26 | |||
| 5 | ### Stage: 收尾 cap48 top2 真实 FMA 对照并发现 high_energy 反超 | 27 | ### Stage: 收尾 cap48 top2 真实 FMA 对照并发现 high_energy 反超 |
| 6 | 28 | ||
| 7 | 完成项: | 29 | 完成项: | ... | ... |
| ... | @@ -503,6 +503,46 @@ cap48 top2 最终结论: | ... | @@ -503,6 +503,46 @@ cap48 top2 最终结论: |
| 503 | 1. 更大 subset(如 64+) | 503 | 1. 更大 subset(如 64+) |
| 504 | 2. 多 seed 复跑 | 504 | 2. 多 seed 复跑 |
| 505 | 3. style-aware bucket benchmark | 505 | 3. style-aware bucket benchmark |
| 506 | |||
| 507 | --- | ||
| 508 | |||
| 509 | ## 13. cap48 top2 第二个 seed(进行中) | ||
| 510 | |||
| 511 | 为验证 cap48 的“high_energy 反超”是否稳定,已启动第二个 seed: | ||
| 512 | |||
| 513 | ```bash | ||
| 514 | cd /workspace/acr-engine | ||
| 515 | /usr/local/miniconda3/bin/python scripts/ab_smoke_segmentation.py \ | ||
| 516 | --dataset fma \ | ||
| 517 | --input-dir data/raw/fma_small_audio \ | ||
| 518 | --work-root /tmp/ab_smoke_seg_cap48_top2_seed123 \ | ||
| 519 | --subset-size 48 \ | ||
| 520 | --query-duration 8 \ | ||
| 521 | --train-epochs 1 \ | ||
| 522 | --batch-size 2 \ | ||
| 523 | --device cpu \ | ||
| 524 | --strategies hybrid high_energy \ | ||
| 525 | --max-test-queries 24 \ | ||
| 526 | --seed 123 \ | ||
| 527 | --output-json /tmp/ab_smoke_seg_cap48_top2_seed123/report.json | ||
| 528 | ``` | ||
| 529 | |||
| 530 | 当前 fresh evidence: | ||
| 531 | |||
| 532 | | 项目 | 状态 | | ||
| 533 | |---|---| | ||
| 534 | | `subset_size` | `48` | | ||
| 535 | | `max_test_queries` | `24` | | ||
| 536 | | `seed` | `123` | | ||
| 537 | | 首个运行策略 | `hybrid` | | ||
| 538 | | 当前阶段 | `run_demo.py build-index --resume --checkpoint-every-refs 100` | | ||
| 539 | | `report.json` | 尚未生成 | | ||
| 540 | |||
| 541 | 恢复检查命令: | ||
| 542 | |||
| 543 | ```bash | ||
| 544 | pgrep -af 'ab_smoke_seg_cap48_top2_seed123|external_adapters.py smoke-local fma /tmp/ab_smoke_seg_cap48_top2_seed123|evaluate.py --data /tmp/ab_smoke_seg_cap48_top2_seed123|run_demo.py build-index --data /tmp/ab_smoke_seg_cap48_top2_seed123|train.py --data /tmp/ab_smoke_seg_cap48_top2_seed123' | ||
| 545 | ``` | ||
| 506 | - `b766c74` Make open-dataset manifests trainable end to end | 546 | - `b766c74` Make open-dataset manifests trainable end to end |
| 507 | - `fa23144` Add a single-page open dataset workflow for training prep | 547 | - `fa23144` Add a single-page open dataset workflow for training prep |
| 508 | - `af33be3` Condense docs and add manifest validation before training | 548 | - `af33be3` Condense docs and add manifest validation before training | ... | ... |
-
Please register or sign in to post a comment