Preserve proof that cap64 hybrid advanced into evaluation before results landed
Constraint: The cap64 run is still incomplete, so only verified hybrid index-complete and evaluation-running evidence can be recorded safely now Rejected: Wait for hybrid eval.json before checkpointing | Would lose the verified handoff that hybrid indexing finished and evaluate.py is already running Confidence: high Scope-risk: narrow Directive: Keep cap64 high_energy and hybrid checkpoints symmetric so the final comparison can be written from docs alone if needed Tested: Verified hybrid reference_progress.json shows 64 refs, 657 windows, 192-d embeddings, and complete status; verified active process is evaluate.py on /tmp/ab_smoke_seg_cap64_top2/hybrid/fma/manifests; verified hybrid eval.json and report.json are still absent Not-tested: Final hybrid cap64 metrics, final report.json, and any cap64 winner conclusion
Showing
4 changed files
with
24 additions
and
3 deletions
| 1 | ## 2026-06-02 cap64 hybrid 索引完成并进入评测 checkpoint | ||
| 2 | |||
| 3 | 完成项: | ||
| 4 | - 已确认 cap64 的 `hybrid` reference index 构建完成。 | ||
| 5 | - 已确认流程从 `hybrid build-index` 推进到 `hybrid evaluate.py`。 | ||
| 6 | |||
| 7 | 验证证据: | ||
| 8 | - `hybrid/fma_index_smoke/reference_progress.json`: | ||
| 9 | - `status=complete` | ||
| 10 | - `refs_done=64` | ||
| 11 | - `windows_done=657` | ||
| 12 | - `embedding_shape=[657, 192]` | ||
| 13 | - `elapsed_sec=107.228` | ||
| 14 | - 进程树显示: | ||
| 15 | - `evaluate.py --data /tmp/ab_smoke_seg_cap64_top2/hybrid/fma/manifests ... --seed 42 --max-queries 32` | ||
| 16 | - 截至本 checkpoint: | ||
| 17 | - `hybrid eval.json` 尚未生成 | ||
| 18 | - 总 `report.json` 尚未生成 | ||
| 19 | |||
| 1 | ## 2026-06-02 cap64 hybrid 训练完成证据 checkpoint | 20 | ## 2026-06-02 cap64 hybrid 训练完成证据 checkpoint |
| 2 | 21 | ||
| 3 | 完成项: | 22 | 完成项: | ... | ... |
| ... | @@ -76,3 +76,5 @@ cd /workspace/acr-engine | ... | @@ -76,3 +76,5 @@ cd /workspace/acr-engine |
| 76 | - 已补充 cap64 阶段推进:`hybrid` 已完成训练并进入 build-index。 | 76 | - 已补充 cap64 阶段推进:`hybrid` 已完成训练并进入 build-index。 |
| 77 | 77 | ||
| 78 | - 已补充 cap64 新鲜证据:从运行会话确认 `hybrid` 的 `Epoch 1/1` 已完整跑完。 | 78 | - 已补充 cap64 新鲜证据:从运行会话确认 `hybrid` 的 `Epoch 1/1` 已完整跑完。 |
| 79 | |||
| 80 | - 已补充 cap64 新鲜证据:`hybrid` reference index 完成(`64 refs / 657 windows / 192-d`)并进入 `evaluate.py`。 | ... | ... |
| ... | @@ -61,5 +61,5 @@ test -f /tmp/ab_smoke_seg_cap48_top2_seed999/report.json && cat /tmp/ab_smoke_se | ... | @@ -61,5 +61,5 @@ test -f /tmp/ab_smoke_seg_cap48_top2_seed999/report.json && cat /tmp/ab_smoke_se |
| 61 | 61 | ||
| 62 | - 新 benchmark:`/tmp/ab_smoke_seg_cap64_top2` | 62 | - 新 benchmark:`/tmp/ab_smoke_seg_cap64_top2` |
| 63 | - 当前阶段:`high_energy` 已完成评测,结果为 `top1=0.625 / topk=1.0 / num_queries=32` | 63 | - 当前阶段:`high_energy` 已完成评测,结果为 `top1=0.625 / topk=1.0 / num_queries=32` |
| 64 | - 当前 `hybrid` 训练已完成,现处于 build-index 阶段 | 64 | - 当前 `hybrid` 索引已完成,现处于 evaluate 阶段 |
| 65 | - 下一 session 应优先检查 `hybrid` 结果与 `report.json` 是否生成 | 65 | - 下一 session 应优先检查 `hybrid` 结果与 `report.json` 是否生成 | ... | ... |
| ... | @@ -676,7 +676,7 @@ seed123 最终结论: | ... | @@ -676,7 +676,7 @@ seed123 最终结论: |
| 676 | - 配置:`subset_size=64`, `max_test_queries=32`, `seed=42` | 676 | - 配置:`subset_size=64`, `max_test_queries=32`, `seed=42` |
| 677 | - 当前最新证据: | 677 | - 当前最新证据: |
| 678 | - `high_energy` 已完成评测:`num_queries=32, top1=0.625, topk=1.0` | 678 | - `high_energy` 已完成评测:`num_queries=32, top1=0.625, topk=1.0` |
| 679 | - 已从运行会话确认 `hybrid` 的 `Epoch 1/1` 完整跑完(`32/32`) | 679 | - `hybrid` reference index 已完成:`64 refs / 657 windows / 192-d` |
| 680 | - 当前处于 `run_demo.py build-index` | 680 | - `hybrid` 当前已进入 `evaluate.py` |
| 681 | - 总 `report.json` 尚未生成 | 681 | - 总 `report.json` 尚未生成 |
| 682 | 682 | ... | ... |
-
Please register or sign in to post a comment