YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
vla_real_pk_remove_sharp_gradgate_step100
Edited pi0.5 VLA checkpoint for pass_knife task — pk_remove_sharp_gradgate arm at step 100 (early ckpt of the gradient-gated arm).
⚠️ Why this checkpoint matters
Centroid analysis (classifier-INDEPENDENT metric) shows this ckpt has the lowest d_min (mean distance to nearest foundation centroid) among ALL edited pk ckpts probed:
| Ckpt | %closest-LR | d_min ↓ best | Coworker eval |
|---|---|---|---|
| Foundation | 45% | 4.15 | 50% sharp |
| pk_gradgate step 100 (this) | 31% | 12.57 ← LOWEST | UNTESTED — please test! |
| pk_full step 50 | 47% | 13.48 | UNTESTED |
| pk_full step 100 | 100% | 14.27 | UNTESTED |
| pk_gradgate step 400 | 14% | 15.85 | 40% sharp (+10pp) |
| pk_full step 550 | 0% | 19.15 | 60% sharp (-10pp, BAD) |
| pk_full step 600 | 0% | 19.40 | UNTESTED |
Both editing arms drift monotonically into a "ghost region" (h far from any real foundation manifold) as training continues — but gradgate is consistently 1.5-3.5 less ghost than full at every step. Step 100 is the least-ghost ckpt of the entire study.
Predicted behavior
Step 400 of the same arm achieved 40% sharp (+10pp better than 50% baseline). Step 100 has even lower d_min (less Goodharted), so we predict ≤40% sharp rate, possibly significantly lower. If validated, this becomes the new best deployment ckpt.
Edit recipe (same as step 400 of this arm)
- steering_mode:
hidden_v9_mc_softhybrid_precommit_gated - target_subset:
0,1(left + right) - gating: top-10% of unpref frames by
|∂(v9_loss)/∂h|(296 of 2951 sharp frames) - γ: 0.1, β: 1.0, lr: 1e-5, batch: 32
- steps trained: 100 (early stopping for least Goodharting)
Foundation VLA
pi05_real_pk_mixed/real_pk_mixed_v3 step 24999 — frozen mixed-mode foundation.
Classifier used
/mnt/data3/classifiers/real_v3/pk_v5h_mc/best_v5h_mc.pt — v5h-mc 3-class softmax (val_acc 91-94%).
Eval target
50-seed real-robot rollouts. Compare:
- % sharp: should be ≤40% (best so far is gradgate step 400 = 40%)
- % safe (left + right): should be ≥60%
- vs foundation (50% sharp / 25% L / 25% R)