Preserve the larger cap24 top-two benchmark checkpoint
Record the new 24-track capped benchmark setup and the first completed hybrid result so the next session can continue the stronger tie-break experiment without rediscovering runtime state. Constraint: The cap24 benchmark is still in progress, so only partial evidence can be documented now Rejected: Wait for high_energy to finish before updating handoff | Risks losing the fresh larger-subset evidence if the session ends first Confidence: high Scope-risk: narrow Directive: Replace the partial cap24 section with the final two-strategy ranking once report.json lands Tested: Verified /tmp/ab_smoke_seg_cap24_top2/hybrid/fma_reports_smoke/eval.json; verified active cap24 processes; verified docs include the exact work-root and resume command Not-tested: Final cap24 top-two comparison because high_energy is still training
Showing
2 changed files
with
60 additions
and
0 deletions
| ... | @@ -2,6 +2,27 @@ | ... | @@ -2,6 +2,27 @@ |
| 2 | 2 | ||
| 3 | ## 2026-06-02 | 3 | ## 2026-06-02 |
| 4 | 4 | ||
| 5 | ### Stage: 启动更大 cap24 top2 真实 FMA 对照并记录首条结果 | ||
| 6 | |||
| 7 | 完成项: | ||
| 8 | - 启动: | ||
| 9 | - `/tmp/ab_smoke_seg_cap24_top2` | ||
| 10 | - 策略仅保留 `hybrid` 与 `high_energy` | ||
| 11 | - `subset_size = 24` | ||
| 12 | - `max_test_queries = 16` | ||
| 13 | - 更新 [session-handoff.md](./session-handoff.md) | ||
| 14 | |||
| 15 | 当前 fresh evidence: | ||
| 16 | - `hybrid` 已完成: | ||
| 17 | - `num_queries = 16` | ||
| 18 | - `top1 = 1.0` | ||
| 19 | - `topk = 1.0` | ||
| 20 | - `high_energy` 已进入训练阶段,整轮对照尚未完成 | ||
| 21 | |||
| 22 | 结论: | ||
| 23 | - 在比 cap16 更大的真实 FMA 子集上,`hybrid` 目前仍保持满分 | ||
| 24 | - 下一步只需等待 `high_energy` 完成,就能判断两者在更大子集上是否继续打平或拉开 | ||
| 25 | |||
| 5 | ### Stage: 收尾 cap16 真实 FMA capped segmentation benchmark | 26 | ### Stage: 收尾 cap16 真实 FMA capped segmentation benchmark |
| 6 | 27 | ||
| 7 | 完成项: | 28 | 完成项: | ... | ... |
| ... | @@ -373,6 +373,45 @@ cd /workspace/acr-engine | ... | @@ -373,6 +373,45 @@ cd /workspace/acr-engine |
| 373 | - 默认优先:`hybrid` | 373 | - 默认优先:`hybrid` |
| 374 | - 强次选:`high_energy` | 374 | - 强次选:`high_energy` |
| 375 | - `beat_aware` / `repeated_section_aware` 更适合作为补充对照,而不是默认策略 | 375 | - `beat_aware` / `repeated_section_aware` 更适合作为补充对照,而不是默认策略 |
| 376 | |||
| 377 | --- | ||
| 378 | |||
| 379 | ## 10. cap24 top2 对照实验(进行中) | ||
| 380 | |||
| 381 | 为进一步判断 `hybrid` 与 `high_energy` 的并列关系,已经启动更大的真实 FMA 对照: | ||
| 382 | |||
| 383 | ```bash | ||
| 384 | cd /workspace/acr-engine | ||
| 385 | /usr/local/miniconda3/bin/python scripts/ab_smoke_segmentation.py \ | ||
| 386 | --dataset fma \ | ||
| 387 | --input-dir data/raw/fma_small_audio \ | ||
| 388 | --work-root /tmp/ab_smoke_seg_cap24_top2 \ | ||
| 389 | --subset-size 24 \ | ||
| 390 | --query-duration 8 \ | ||
| 391 | --train-epochs 1 \ | ||
| 392 | --batch-size 2 \ | ||
| 393 | --device cpu \ | ||
| 394 | --strategies hybrid high_energy \ | ||
| 395 | --max-test-queries 16 \ | ||
| 396 | --output-json /tmp/ab_smoke_seg_cap24_top2/report.json | ||
| 397 | ``` | ||
| 398 | |||
| 399 | 当前 fresh evidence: | ||
| 400 | |||
| 401 | | 策略 | subset | max_test_queries | top1 | topk | 状态 | | ||
| 402 | |---|---:|---:|---:|---:|---| | ||
| 403 | | `hybrid` | 24 | 16 | 1.0 | 1.0 | 已完成 | | ||
| 404 | | `high_energy` | 24 | 16 | - | - | 训练中 | | ||
| 405 | |||
| 406 | 恢复检查命令: | ||
| 407 | |||
| 408 | ```bash | ||
| 409 | pgrep -af 'ab_smoke_seg_cap24_top2|external_adapters.py smoke-local fma /tmp/ab_smoke_seg_cap24_top2|evaluate.py --data /tmp/ab_smoke_seg_cap24_top2|run_demo.py build-index --data /tmp/ab_smoke_seg_cap24_top2' | ||
| 410 | ``` | ||
| 411 | |||
| 412 | 如果 `report.json` 尚未生成,优先等待: | ||
| 413 | - `/tmp/ab_smoke_seg_cap24_top2/high_energy/fma_reports_smoke/eval.json` | ||
| 414 | - `/tmp/ab_smoke_seg_cap24_top2/report.json` | ||
| 376 | - `b766c74` Make open-dataset manifests trainable end to end | 415 | - `b766c74` Make open-dataset manifests trainable end to end |
| 377 | - `fa23144` Add a single-page open dataset workflow for training prep | 416 | - `fa23144` Add a single-page open dataset workflow for training prep |
| 378 | - `af33be3` Condense docs and add manifest validation before training | 417 | - `af33be3` Condense docs and add manifest validation before training | ... | ... |
-
Please register or sign in to post a comment