Benchmark segmentation strategies on a real FMA mini-smoke set
Constraint: Strategy comparisons need real-audio evidence, but the benchmark must stay cheap enough to run repeatedly on CPU during active development Rejected: Judge winners only by top1/topk on a tiny subset | ties hide the practical value of strategies that generate far more usable queries Confidence: medium Scope-risk: narrow Directive: Keep num_queries as a tie-breaker for tiny-smoke comparisons; increase subset size before promoting benchmark winners to default training policy Tested: /usr/local/miniconda3/bin/python acr-engine/scripts/ab_smoke_segmentation.py --dataset fma --input-dir acr-engine/data/raw/fma_small_audio --work-root /tmp/ab_smoke_seg --subset-size 8 --query-duration 8 --train-epochs 1 --batch-size 2 --device cpu --output-json /tmp/ab_smoke_seg/report.json; post-run ranking verification from /tmp/ab_smoke_seg/report.json Not-tested: Larger FMA subsets or difficult internal query mixes in the same benchmark script
Showing
3 changed files
with
234 additions
and
0 deletions
acr-engine/scripts/ab_smoke_segmentation.py
0 → 100644
-
Please register or sign in to post a comment