Commit 55dea0c9 55dea0c93f070946b0199e21b55680c88f59ba2e by cnb.bofCdSsphPA

Record the live FMA download gate before real-data validation

Constraint: Real-data smoke cannot be claimed before the user-provided archive is fully downloaded and locally inspectable.
Rejected: Pretending readiness from partial bytes | That would create false verification evidence for the dataset lane.
Confidence: high
Scope-risk: narrow
Directive: Do not run real FMA extraction or smoke until inspect reports the full expected archive size.
Tested: Re-ran the archive inspect command and confirmed the active background curl process plus current local file size.
Not-tested: Extraction, local preparation, and real FMA smoke remain pending until the archive completes.
1 parent a4c891da
......@@ -2,6 +2,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -841,6 +863,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -890,6 +934,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -949,6 +1015,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -998,6 +1086,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -1045,6 +1155,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -1090,6 +1222,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......@@ -1140,6 +1294,28 @@
## 2026-06-02
### Stage: 真实 FMA 下载状态续验
完成项:
- 复检用户指定 FMA 源下载状态:`https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 复检后台下载进程与本地归档体积
- 确认当前仍未达到可解压/可真实 smoke 的完成门槛
验证结果:
- `/usr/local/miniconda3/bin/python scripts/prepare_fma_archive.py inspect` 返回:
- `archive_bytes_expected=7679594875`
- `archive_size=3117514752`
- `archive_progress_percent=40.5948`
- `num_audio_files=0`
- 后台下载进程仍存活:
- `curl -L --continue-at - --output data/raw/fma_small.zip https://modelscope.cn/datasets/pengzhendong/fma/resolve/master/fma_small.zip`
- 当前本地归档文件:
- `data/raw/fma_small.zip = 3.0G`
结论:
- 当前主卡点仍是 FMA 归档未完整下载
- 真实 FMA 的解压、prepare、smoke-local 需要等待归档达到完整体积后继续
### Stage: 训练数据与 pgvector 专项说明补强
完成项:
......