Commit 124d4612 124d4612a85301cf0fbb4000553e0eba8ec667b3 by cnb.bofCdSsphPA

Checkpoint the second cap48 seed while the rerun is still building

Preserve the second-seed cap48 entry point and current build-index phase so later sessions can validate whether the cap48 reversal was stable or a seed artifact.

Constraint: The second-seed run has not produced scores yet, so only execution-state evidence is available
Rejected: Wait for the seed123 scores before recording anything | Risks losing the multi-seed validation checkpoint if the session ends first
Confidence: high
Scope-risk: narrow
Directive: Replace the seed123 running-state section with measured scores once hybrid eval.json or report.json land
Tested: Verified active cap48 seed123 processes; verified handoff records work-root, seed, subset size, query cap, and current build-index phase
Not-tested: cap48 seed123 strategy scores because the run is still in progress
1 parent d82d217a
...@@ -2,6 +2,28 @@ ...@@ -2,6 +2,28 @@
2 2
3 ## 2026-06-02 3 ## 2026-06-02
4 4
5 ### Stage: 启动 cap48 第二个 seed 复核反转结果
6
7 完成项:
8 - 启动第二个 seed 的 cap48 top2 benchmark:
9 - `work_root = /tmp/ab_smoke_seg_cap48_top2_seed123`
10 - `subset_size = 48`
11 - `max_test_queries = 24`
12 - `seed = 123`
13 - 策略:`hybrid` vs `high_energy`
14 - 更新 [session-handoff.md](./session-handoff.md)
15
16 当前 fresh evidence:
17 - 第二个 seed 已启动
18 - 当前 first lane 为:
19 - `hybrid`
20 - 当前已进入:
21 - `run_demo.py build-index --resume --checkpoint-every-refs 100`
22
23 结论:
24 - 已经从“单轮 cap48 反转”升级为“开始做多 seed 复核”
25 - 即使当前 session 结束,新 session 也可直接从 handoff 中的 `cap48_top2_seed123` 入口继续
26
5 ### Stage: 收尾 cap48 top2 真实 FMA 对照并发现 high_energy 反超 27 ### Stage: 收尾 cap48 top2 真实 FMA 对照并发现 high_energy 反超
6 28
7 完成项: 29 完成项:
......
...@@ -503,6 +503,46 @@ cap48 top2 最终结论: ...@@ -503,6 +503,46 @@ cap48 top2 最终结论:
503 1. 更大 subset(如 64+) 503 1. 更大 subset(如 64+)
504 2. 多 seed 复跑 504 2. 多 seed 复跑
505 3. style-aware bucket benchmark 505 3. style-aware bucket benchmark
506
507 ---
508
509 ## 13. cap48 top2 第二个 seed(进行中)
510
511 为验证 cap48 的“high_energy 反超”是否稳定,已启动第二个 seed:
512
513 ```bash
514 cd /workspace/acr-engine
515 /usr/local/miniconda3/bin/python scripts/ab_smoke_segmentation.py \
516 --dataset fma \
517 --input-dir data/raw/fma_small_audio \
518 --work-root /tmp/ab_smoke_seg_cap48_top2_seed123 \
519 --subset-size 48 \
520 --query-duration 8 \
521 --train-epochs 1 \
522 --batch-size 2 \
523 --device cpu \
524 --strategies hybrid high_energy \
525 --max-test-queries 24 \
526 --seed 123 \
527 --output-json /tmp/ab_smoke_seg_cap48_top2_seed123/report.json
528 ```
529
530 当前 fresh evidence:
531
532 | 项目 | 状态 |
533 |---|---|
534 | `subset_size` | `48` |
535 | `max_test_queries` | `24` |
536 | `seed` | `123` |
537 | 首个运行策略 | `hybrid` |
538 | 当前阶段 | `run_demo.py build-index --resume --checkpoint-every-refs 100` |
539 | `report.json` | 尚未生成 |
540
541 恢复检查命令:
542
543 ```bash
544 pgrep -af 'ab_smoke_seg_cap48_top2_seed123|external_adapters.py smoke-local fma /tmp/ab_smoke_seg_cap48_top2_seed123|evaluate.py --data /tmp/ab_smoke_seg_cap48_top2_seed123|run_demo.py build-index --data /tmp/ab_smoke_seg_cap48_top2_seed123|train.py --data /tmp/ab_smoke_seg_cap48_top2_seed123'
545 ```
506 - `b766c74` Make open-dataset manifests trainable end to end 546 - `b766c74` Make open-dataset manifests trainable end to end
507 - `fa23144` Add a single-page open dataset workflow for training prep 547 - `fa23144` Add a single-page open dataset workflow for training prep
508 - `af33be3` Condense docs and add manifest validation before training 548 - `af33be3` Condense docs and add manifest validation before training
......