Raise ACR robustness with retrieval-first structure and music-aware inputs
Shift the prototype toward music-retrieval behavior by documenting dataset contracts, upgrading the frontend to 128-bin Mel plus band splitting, and adding retrieval evaluation plus harder confusion-oriented augmentation. Constraint: The previous pipeline mixed train splits with the searchable catalog and hid real retrieval quality Rejected: Keep classification-centric validation and whole-song averaged references | it masked structural accuracy failures Confidence: medium Scope-risk: moderate Directive: Next iterations should target humming/confused top1 with specialized melody-aware retrieval and stronger real-data calibration Tested: synthetic_v2 generation; 3-epoch CPU training; index build; evaluate.py top1=0.65 top5=0.95 on test split Not-tested: external open-dataset ingestion; foundation-model baselines; production latency
Showing
9 changed files
with
390 additions
and
35 deletions
docs/dataset-spec.md
0 → 100644
docs/sota-research-2026.md
0 → 100644
-
Please register or sign in to post a comment