Capture another live FMA smoke progress checkpoint
Keep the restart artifacts synchronized with the newest observed elapsed time so the next session can see that the real FMA smoke is still advancing without yet reaching model save or evaluation stages. Constraint: Training remains inside Epoch 1, so verification is limited to live runtime evidence Rejected: Stop at the prior 17:07 checkpoint | would leave handoff docs behind the latest verified state Confidence: high Scope-risk: narrow Directive: Continue monitoring until the first saved model file or stage transition appears Tested: ps on PID 311629; validate-splits on /tmp/fma_real_smoke_stopcheck/fma/manifests; find on /tmp/fma_real_smoke_stopcheck/fma_models_smoke Not-tested: End-of-epoch artifacts, build-index, evaluate, final metrics
Showing
3 changed files
with
45 additions
and
0 deletions
-
Please register or sign in to post a comment