| Name |
Last Commit
81704ace
–
capture the first real-path index-to-evaluate closure\n\nConstraint: Delivery state must reflect fresh evaluate evidence without staging temporary eval assets\nRejected: Wait for larger-scale or hard-case metrics | The first explicit evaluate closure is already a meaningful milestone and restart-safe handoff point\nConfidence: high\nScope-risk: narrow\nDirective: Reuse /tmp/fma_realpath_small_rerun_index2 and /tmp/fma_realpath_small_rerun_eval as the next validation baseline before scaling up\nTested: Verified eval_top50.json at num_queries 35 with top1 0.8571 and topk 1.0, confirmed query-count explanation, and updated handoff/changelog docs\nNot-tested: Larger query caps, hard-case buckets, and full-scale FMA evaluate runs
|
History
|
Last Update |
|---|---|---|
| .. | ||
| ai-slop-cleaner | Loading commit data... | |
| analyze | Loading commit data... | |
| ask | Loading commit data... | |
| autopilot | Loading commit data... | |
| autoresearch | Loading commit data... | |
| autoresearch-goal | Loading commit data... | |
| best-practice-research | Loading commit data... | |
| cancel | Loading commit data... | |
| code-review | Loading commit data... | |
| configure-notifications | Loading commit data... | |
| deep-interview | Loading commit data... | |
| design | Loading commit data... | |
| doctor | Loading commit data... | |
| hud | Loading commit data... | |
| omx-setup | Loading commit data... | |
| performance-goal | Loading commit data... | |
| pipeline | Loading commit data... | |
| plan | Loading commit data... | |
| prometheus-strict | Loading commit data... | |
| ralph | Loading commit data... | |
| ralplan | Loading commit data... | |
| skill | Loading commit data... | |
| team | Loading commit data... | |
| ultragoal | Loading commit data... | |
| ultraqa | Loading commit data... | |
| ultrawork | Loading commit data... | |
| visual-ralph | Loading commit data... | |
| wiki | Loading commit data... | |
| worker | Loading commit data... |