Commit 7c54eb28 7c54eb286d2d74d66e55e9bb44d6677ed13353c7 by cnb.bofCdSsphPA

Lock in a stable official route for real FMA archive acquisition

Constraint: Real-data progress was blocked until we could prove an upstream archive path that still works today
Rejected: Continue iterating on historical per-track URLs | Those paths already proved unstable via 403 and 404 evidence
Confidence: high
Scope-risk: narrow
Directive: Prefer the verified fma_small.zip archive route over legacy page or single-track scraping paths unless upstream changes again
Tested: curl -I -L --max-time 60 https://os.unil.cloud.switch.ch/fma/fma_small.zip; curl -L --range 0-1023 --max-time 60 -o /tmp/fma_small_probe.bin https://os.unil.cloud.switch.ch/fma/fma_small.zip
Not-tested: Full 7.68 GB archive download, extraction, and smoke execution remain pending
1 parent 7ea9b1d0
...@@ -225,6 +225,27 @@ ...@@ -225,6 +225,27 @@
225 225
226 226
227 227
228
229 ### Stage: FMA 官方整包下载路径确认
230
231 完成项:
232 - 进一步验证 FMA Small 的稳定 archive 下载路径
233 - 确认 `fma_small.zip` 可通过官方归档地址直接访问
234 - 将该路径补充到开放数据工作流与交接文档
235
236 验证结果:
237 - `curl -I -L --max-time 60 https://os.unil.cloud.switch.ch/fma/fma_small.zip` 成功
238 - 当前响应头关键信息:
239 - `200 OK`
240 - `Content-Type: application/zip`
241 - `Content-Length: 7679594875`
242 - `Accept-Ranges: bytes`
243 - `curl -L --range 0-1023 ...` 成功获取前 `1024` bytes
244
245 结论:
246 - FMA 真实数据已经不再缺下载地址
247 - 当前剩余问题从“找不到稳定 URL”转为“是否开始实际拉取 7.68 GB 归档并落盘”
248
228 ### Stage: FMA 下载器模块调用修复 249 ### Stage: FMA 下载器模块调用修复
229 250
230 完成项: 251 完成项:
......
...@@ -276,7 +276,7 @@ ...@@ -276,7 +276,7 @@
276 - [docs/session-handoff.md](./session-handoff.md) 276 - [docs/session-handoff.md](./session-handoff.md)
277 - [docs/current-capability-map.md](./current-capability-map.md) 277 - [docs/current-capability-map.md](./current-capability-map.md)
278 - [acr-engine/FIRST_RUN_CHECKLIST.md](../acr-engine/FIRST_RUN_CHECKLIST.md) 278 - [acr-engine/FIRST_RUN_CHECKLIST.md](../acr-engine/FIRST_RUN_CHECKLIST.md)
279 - FMA 真实子集下载脚手架已存在:[acr-engine/scripts/fetch_fma_subset.py](../acr-engine/scripts/fetch_fma_subset.py);最近验证结果是旧直链 `403`随后已修复为 `python -m yt_dlp` 调用,但页面级历史 URL 又返回 `404` 279 - FMA 真实子集下载脚手架已存在:[acr-engine/scripts/fetch_fma_subset.py](../acr-engine/scripts/fetch_fma_subset.py);最近验证结果是旧直链 `403`页面级历史 URL `404`;但 `https://os.unil.cloud.switch.ch/fma/fma_small.zip` 已验证 `200 OK` 且支持 range
280 - 运行 [acr-engine/scripts/status_snapshot.py](../acr-engine/scripts/status_snapshot.py) 280 - 运行 [acr-engine/scripts/status_snapshot.py](../acr-engine/scripts/status_snapshot.py)
281 - 或直接查看最新落盘快照:`acr-engine/.omx/latest_status_snapshot.json` 281 - 或直接查看最新落盘快照:`acr-engine/.omx/latest_status_snapshot.json`
282 282
......