Skip to content
  • This project
    • Loading...
  • Sign in

wanghai-tech / hikoon-ACR

Go to a project
Toggle navigation
Toggle navigation pinning
  • Projects
  • Groups
  • Snippets
  • Help
  • Project
  • Activity
  • Repository
  • Pipelines
  • Graphs
  • Issues 0
  • Merge Requests 0
  • Wiki
  • Network
  • Create a new issue
  • Builds
  • Commits
  • Issue Boards
  • Files
  • Commits
  • Network
  • Compare
  • Branches
  • Tags
Switch branch/tag
  • hikoon-ACR
  • acr-engine
  • data
  • index_v5
  • reference_embs.npy
  • cnb.bofCdSsphPA's avatar
    Extend dataset bootstrap coverage and improve humming hard-case weighting · 48c97a90 ...
    48c97a90
    Broaden external dataset bootstrap support and replace naive hard-case oversampling with a more targeted weighting signal that measurably helps humming-like queries while preserving the release/eval workflow.
    
    Constraint: Hard-case optimization must be evidence-driven and preserve a record of mixed outcomes across iterations
    Rejected: Reuse naive oversampling after regression | it already showed worse overall behavior with no hard-case gain
    Confidence: medium
    Scope-risk: moderate
    Directive: Next iteration should target confused-case negatives explicitly; do not assume humming gains transfer to confusion robustness
    Tested: bootstrap generation for MTG-Jamendo and ModelScope placeholders; 2-epoch CPU training for models_v5; index_v5 build; fast eval JSON generation for smoke-v5
    Not-tested: real audio ingestion for the new datasets; full melody-aware slow evaluation on models_v5
    cnb.bofCdSsphPA authored 2026-06-02 12:15:19 +0800
reference_embs.npy 90.1 KB
Raw History Permalink

Download (90.1 KB)