Refresh session handoff for the current voice service runtime state
Constraint: the handoff must reflect the real runtime state: health endpoints work, CPU torch is installed, but end-to-end voice smoke still times out Rejected: keeping the older dependency-missing note | it no longer matches the current environment and would mislead the next session Confidence: high Scope-risk: narrow Directive: keep handoff notes focused on the shortest next debugging path for /recognize/voice timeout reduction Tested: /usr/local/miniconda3/bin/python -m unittest discover -s acr-engine/tests -v; /health endpoint returns ok under uvicorn with CPU torch installed Not-tested: successful end-to-end /recognize/voice payload within timeout
Showing
1 changed file
with
23 additions
and
1 deletions
| ... | @@ -30,7 +30,7 @@ | ... | @@ -30,7 +30,7 @@ |
| 30 | - `acr-engine/src/service/app.py` 已新增 `POST /recognize/voice` | 30 | - `acr-engine/src/service/app.py` 已新增 `POST /recognize/voice` |
| 31 | - `/health` 可正常启动并返回 `ok` | 31 | - `/health` 可正常启动并返回 `ok` |
| 32 | - architect review: approved with watch;当前 split(本地 FAISS / 可选 ChromaDB / 生产 pgvector)方向成立 | 32 | - architect review: approved with watch;当前 split(本地 FAISS / 可选 ChromaDB / 生产 pgvector)方向成立 |
| 33 | - 当前 `POST /recognize/voice` 仍需完整 smoke 复核;此前明确阻塞点是 `torch` 缺失,现已补 CPU 版 `torch`,当前剩余现象是 voice 端到端 smoke 超时,需要继续收敛推理耗时或缓存策略 | 33 | - 当前 `POST /recognize/voice` 已跨过依赖缺失阶段:CPU 版 `torch` 已安装、`uvicorn` / `fastapi` / `python-multipart` 已安装、`/health` 可返回 `ok`;当前剩余现象是 voice 端到端 smoke 仍超时,需要继续收敛推理耗时、chunk 数量或缓存策略 |
| 34 | - 当前 docs 已做第一轮简化: | 34 | - 当前 docs 已做第一轮简化: |
| 35 | - `docs/README.md` 只保留最新架构与最短阅读顺序 | 35 | - `docs/README.md` 只保留最新架构与最短阅读顺序 |
| 36 | 36 | ||
| ... | @@ -46,6 +46,28 @@ | ... | @@ -46,6 +46,28 @@ |
| 46 | 3. 把哼唱评测集接入 `evaluate.py` 或独立评测脚本 | 46 | 3. 把哼唱评测集接入 `evaluate.py` 或独立评测脚本 |
| 47 | 4. 继续做 docs 第二轮收敛,只保留当前有效主文档 | 47 | 4. 继续做 docs 第二轮收敛,只保留当前有效主文档 |
| 48 | 48 | ||
| 49 | ### 最新补充(2026-06-03 voice service runtime) | ||
| 50 | |||
| 51 | - 已确认当前解释器 `/usr/local/miniconda3/bin/python` 下: | ||
| 52 | - `torch==2.3.1+cpu` | ||
| 53 | - `uvicorn==0.48.0` | ||
| 54 | - `fastapi==0.136.3` | ||
| 55 | - `python-multipart==0.0.30` | ||
| 56 | - `acr-engine/src/service/app.py` 当前可启动并通过: | ||
| 57 | - `GET /health` | ||
| 58 | - `GET /ready` | ||
| 59 | - `POST /recognize/voice` 当前不再报缺依赖,但端到端 smoke 仍会超时。 | ||
| 60 | - 当前最可能的下一步排查点: | ||
| 61 | 1. `voice_to_chunks` 默认切出的 chunk 数过多(当前样例可到 9 个) | ||
| 62 | 2. 首次 `_load_engine()` + `engine.recognize()` CPU 推理耗时过长 | ||
| 63 | 3. `context_exporter` 在每个候选上再次做 reference 滑窗,进一步拉长总耗时 | ||
| 64 | - 下一 session 建议直接从: | ||
| 65 | - `acr-engine/src/service/app.py` | ||
| 66 | - `acr-engine/scripts/service_voice_smoke.py` | ||
| 67 | - `acr-engine/src/data/voice_chunker.py` | ||
| 68 | - `acr-engine/src/utils/context_exporter.py` | ||
| 69 | 继续收敛超时问题。 | ||
| 70 | |||
| 49 | ### 最新交付快照(2026-06-02 16:12 UTC) | 71 | ### 最新交付快照(2026-06-02 16:12 UTC) |
| 50 | 72 | ||
| 51 | - 当前状态:先交付,后续重启继续 | 73 | - 当前状态:先交付,后续重启继续 | ... | ... |
-
Please register or sign in to post a comment