1. 02 Jun, 2026 7 commits
    • Prepare the prototype for industrial evolution by adding a service surface, external manifest conversion tools, and dataset adapter scaffolding with explicit licensing checkpoints.
      
      Constraint: Commercialization requires auditable data ingress and callable service boundaries, not just offline notebooks
      Rejected: Delay service and data-ingest work until after model perfection | would block end-to-end productization and ops readiness
      Confidence: medium
      Scope-risk: moderate
      Directive: Next stages should connect real whitelisted datasets, benchmark latency, and improve hard-case acceptance/rejection quality
      Tested: dataset adapter registry/describe/init commands; manifest csv-to-catalog; service health; service build_index; service recognize; train.py --dry-run
      Not-tested: live uvicorn deployment; external dataset downloads; ANN-backed production indexing
      cnb.bofCdSsphPA authored
    • add src · 31a72045
      cnb.bofCdSsphPA authored
    • Shift the prototype toward music-retrieval behavior by documenting dataset contracts, upgrading the frontend to 128-bin Mel plus band splitting, and adding retrieval evaluation plus harder confusion-oriented augmentation.
      
      Constraint: The previous pipeline mixed train splits with the searchable catalog and hid real retrieval quality
      Rejected: Keep classification-centric validation and whole-song averaged references | it masked structural accuracy failures
      Confidence: medium
      Scope-risk: moderate
      Directive: Next iterations should target humming/confused top1 with specialized melody-aware retrieval and stronger real-data calibration
      Tested: synthetic_v2 generation; 3-epoch CPU training; index build; evaluate.py top1=0.65 top5=0.95 on test split
      Not-tested: external open-dataset ingestion; foundation-model baselines; production latency
      cnb.bofCdSsphPA authored
    • period upload · 62688d3b
      cnb.bofCdSsphPA authored
    • Add missing project documentation and a minimal executable demo flow so the repository can be understood and validated end to end.
      
      Constraint: The existing repo had design fragments but no verified runnable path
      Rejected: Delay documentation until after full productization | would keep scope opaque and slow iteration
      Confidence: medium
      Scope-risk: moderate
      Directive: Keep future stages checkpointed with changelog entries and runnable verification commands
      Tested: synthetic dataset generation; train.py --dry-run; 1 epoch CPU training; index build; recognition JSON output
      Not-tested: production-scale retrieval; real copyrighted audio; API serving
      cnb.bofCdSsphPA authored
    • add codex · e25a16be
      cnb.bofCdSsphPA authored
    • default env · 412d4f98
      cnb.bofCdSsphPA authored