Name Last Update
..
CHANGELOG.md Loading commit data...
README.md Loading commit data...
delivery-onepager.md Loading commit data...
postgres_db_schema_samples.md Loading commit data...
postgresql-data-model.md Loading commit data...
research-delivery-roadmap.md Loading commit data...
selected20_songid_eval.md Loading commit data...
selected20_songid_eval_repro.md Loading commit data...
session-handoff.md Loading commit data...
song-ingest-query-delivery.md Loading commit data...
start-here.md Loading commit data...

ACR Docs Overview

当前 docs 只保留与 song-centric + 4 表融合 schema 直接相关的文档。


1. 先看什么

新同学接手顺序:

  1. start-here.md
  2. delivery-onepager.md
  3. session-handoff.md
  4. song-ingest-query-delivery.md
  5. research-delivery-roadmap.md
  6. selected20_songid_eval.md
  7. postgresql-data-model.md
  8. postgres_db_schema_samples.md
  9. CHANGELOG.md

推荐把前 4 份当作默认接手主路径:

start-here -> delivery-onepager -> session-handoff -> song-ingest-query-delivery

2. 当前默认设计口径

逻辑语义:

song -> asset -> window -> fingerprint / embedding

物理落表:

media_entity -> audio_object -> feature_fact -> set_membership

核心目标:

  • 最终稳定返回 song_id
  • 同一个 song 下允许多个音频文件
  • window 是切片/evidence/召回最小单元
  • feature_fact 同时承载 exact lane 与 semantic lane
  • Phase-1 直接复用开源 encoder,不先训练/微调

3. 一键验证主链

cd /workspace
/usr/local/miniconda3/bin/python acr-engine/scripts/run_songcentric_directory_pipeline_live.py \
  --dsn 'postgres://d2:d2pass@127.0.0.1:5432/d2' \
  --schema acr_songcentric_test \
  --input-root acr-engine/data/songcentric_builder_smoke \
  --output-dir acr-engine/data/pgvector_eval/music20

包装脚本:

acr-engine/scripts/start_songcentric_shortest_path.sh 'postgres://d2:d2pass@127.0.0.1:5432/d2'

当前 fresh evidence:

  • song_count = 2
  • asset_count = 2
  • window_count = 5
  • matcher_fingerprint_count = 5
  • fallback_fingerprint_count = 0
  • semantic_runtime_available = true
  • semantic_runtime_ready_count = 5
  • semantic_fallback_count = 0
  • import_counts.feature_fact = 34

4. 当前保留文档分别解决什么


5. 文档维护命令

/usr/local/miniconda3/bin/python /workspace/scripts/check_markdown_links.py --root /workspace/docs