Add batch inventory for multiple open music directories
Constraint: Personal-use dataset preparation needs fast comparison across several local open-music corpora before ingestion Rejected: Inspect each dataset directory manually one by one | Slows repeated train/eval setup and comparison Confidence: high Scope-risk: narrow Directive: Use inspect-batch on real FMA and MTG-Jamendo folders before selecting training and held-out evaluation corpora Tested: /usr/local/miniconda3/bin/python -m py_compile src/data/external_adapters.py src/data/manifest_tools.py; /usr/local/miniconda3/bin/python src/data/external_adapters.py inspect-batch fma=tmp/open_music_demo_fma mtg_jamendo=tmp/open_music_demo_jamendo --eval-ratio 0.5 --query-duration 5.0 Not-tested: Real upstream corpus inventory on downloaded full-size open datasets
Showing
4 changed files
with
49 additions
and
0 deletions
-
Please register or sign in to post a comment