Commit 48a5957a 48a5957aba254114c0d37aa54c7abab3b019da5b by cnb.bofCdSsphPA

Preserve the larger cap24 top-two benchmark checkpoint

Record the new 24-track capped benchmark setup and the first completed hybrid result so the next session can continue the stronger tie-break experiment without rediscovering runtime state.

Constraint: The cap24 benchmark is still in progress, so only partial evidence can be documented now
Rejected: Wait for high_energy to finish before updating handoff | Risks losing the fresh larger-subset evidence if the session ends first
Confidence: high
Scope-risk: narrow
Directive: Replace the partial cap24 section with the final two-strategy ranking once report.json lands
Tested: Verified /tmp/ab_smoke_seg_cap24_top2/hybrid/fma_reports_smoke/eval.json; verified active cap24 processes; verified docs include the exact work-root and resume command
Not-tested: Final cap24 top-two comparison because high_energy is still training
1 parent c659380d
...@@ -2,6 +2,27 @@ ...@@ -2,6 +2,27 @@
2 2
3 ## 2026-06-02 3 ## 2026-06-02
4 4
5 ### Stage: 启动更大 cap24 top2 真实 FMA 对照并记录首条结果
6
7 完成项:
8 - 启动:
9 - `/tmp/ab_smoke_seg_cap24_top2`
10 - 策略仅保留 `hybrid``high_energy`
11 - `subset_size = 24`
12 - `max_test_queries = 16`
13 - 更新 [session-handoff.md](./session-handoff.md)
14
15 当前 fresh evidence:
16 - `hybrid` 已完成:
17 - `num_queries = 16`
18 - `top1 = 1.0`
19 - `topk = 1.0`
20 - `high_energy` 已进入训练阶段,整轮对照尚未完成
21
22 结论:
23 - 在比 cap16 更大的真实 FMA 子集上,`hybrid` 目前仍保持满分
24 - 下一步只需等待 `high_energy` 完成,就能判断两者在更大子集上是否继续打平或拉开
25
5 ### Stage: 收尾 cap16 真实 FMA capped segmentation benchmark 26 ### Stage: 收尾 cap16 真实 FMA capped segmentation benchmark
6 27
7 完成项: 28 完成项:
......
...@@ -373,6 +373,45 @@ cd /workspace/acr-engine ...@@ -373,6 +373,45 @@ cd /workspace/acr-engine
373 - 默认优先:`hybrid` 373 - 默认优先:`hybrid`
374 - 强次选:`high_energy` 374 - 强次选:`high_energy`
375 - `beat_aware` / `repeated_section_aware` 更适合作为补充对照,而不是默认策略 375 - `beat_aware` / `repeated_section_aware` 更适合作为补充对照,而不是默认策略
376
377 ---
378
379 ## 10. cap24 top2 对照实验(进行中)
380
381 为进一步判断 `hybrid``high_energy` 的并列关系,已经启动更大的真实 FMA 对照:
382
383 ```bash
384 cd /workspace/acr-engine
385 /usr/local/miniconda3/bin/python scripts/ab_smoke_segmentation.py \
386 --dataset fma \
387 --input-dir data/raw/fma_small_audio \
388 --work-root /tmp/ab_smoke_seg_cap24_top2 \
389 --subset-size 24 \
390 --query-duration 8 \
391 --train-epochs 1 \
392 --batch-size 2 \
393 --device cpu \
394 --strategies hybrid high_energy \
395 --max-test-queries 16 \
396 --output-json /tmp/ab_smoke_seg_cap24_top2/report.json
397 ```
398
399 当前 fresh evidence:
400
401 | 策略 | subset | max_test_queries | top1 | topk | 状态 |
402 |---|---:|---:|---:|---:|---|
403 | `hybrid` | 24 | 16 | 1.0 | 1.0 | 已完成 |
404 | `high_energy` | 24 | 16 | - | - | 训练中 |
405
406 恢复检查命令:
407
408 ```bash
409 pgrep -af 'ab_smoke_seg_cap24_top2|external_adapters.py smoke-local fma /tmp/ab_smoke_seg_cap24_top2|evaluate.py --data /tmp/ab_smoke_seg_cap24_top2|run_demo.py build-index --data /tmp/ab_smoke_seg_cap24_top2'
410 ```
411
412 如果 `report.json` 尚未生成,优先等待:
413 - `/tmp/ab_smoke_seg_cap24_top2/high_energy/fma_reports_smoke/eval.json`
414 - `/tmp/ab_smoke_seg_cap24_top2/report.json`
376 - `b766c74` Make open-dataset manifests trainable end to end 415 - `b766c74` Make open-dataset manifests trainable end to end
377 - `fa23144` Add a single-page open dataset workflow for training prep 416 - `fa23144` Add a single-page open dataset workflow for training prep
378 - `af33be3` Condense docs and add manifest validation before training 417 - `af33be3` Condense docs and add manifest validation before training
......