Make segmentation strategy benchmarks comparable under fixed query budgets
Clarify that the pipeline already mixes random sampling with librosa-guided candidate selection, while keeping heavier structural segmentation as a later optimization path. Constraint: Must avoid staging local datasets and transient smoke artifacts Rejected: Full librosa.segment.* default rollout | Too CPU-heavy and too distribution-shaping for current smoke/training stage Confidence: high Scope-risk: narrow Directive: Keep future segmentation comparisons capped by equal query budgets when reporting quality deltas Tested: py_compile for evaluate/external_adapters/ab_smoke_segmentation; evaluate.py --max-queries 5; ab_smoke_segmentation end-to-end smoke with max_test_queries=5 Not-tested: Multi-strategy medium-size capped A/B benchmark on larger real FMA subset
Showing
7 changed files
with
164 additions
and
9 deletions
-
Please register or sign in to post a comment