YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

vla_real_pk_remove_sharp_gradgate_step100

Edited pi0.5 VLA checkpoint for pass_knife task — pk_remove_sharp_gradgate arm at step 100 (early ckpt of the gradient-gated arm).

⚠️ Why this checkpoint matters

Centroid analysis (classifier-INDEPENDENT metric) shows this ckpt has the lowest d_min (mean distance to nearest foundation centroid) among ALL edited pk ckpts probed:

Ckpt %closest-LR d_min ↓ best Coworker eval
Foundation 45% 4.15 50% sharp
pk_gradgate step 100 (this) 31% 12.57 ← LOWEST UNTESTED — please test!
pk_full step 50 47% 13.48 UNTESTED
pk_full step 100 100% 14.27 UNTESTED
pk_gradgate step 400 14% 15.85 40% sharp (+10pp)
pk_full step 550 0% 19.15 60% sharp (-10pp, BAD)
pk_full step 600 0% 19.40 UNTESTED

Both editing arms drift monotonically into a "ghost region" (h far from any real foundation manifold) as training continues — but gradgate is consistently 1.5-3.5 less ghost than full at every step. Step 100 is the least-ghost ckpt of the entire study.

Predicted behavior

Step 400 of the same arm achieved 40% sharp (+10pp better than 50% baseline). Step 100 has even lower d_min (less Goodharted), so we predict ≤40% sharp rate, possibly significantly lower. If validated, this becomes the new best deployment ckpt.

Edit recipe (same as step 400 of this arm)

  • steering_mode: hidden_v9_mc_softhybrid_precommit_gated
  • target_subset: 0,1 (left + right)
  • gating: top-10% of unpref frames by |∂(v9_loss)/∂h| (296 of 2951 sharp frames)
  • γ: 0.1, β: 1.0, lr: 1e-5, batch: 32
  • steps trained: 100 (early stopping for least Goodharting)

Foundation VLA

pi05_real_pk_mixed/real_pk_mixed_v3 step 24999 — frozen mixed-mode foundation.

Classifier used

/mnt/data3/classifiers/real_v3/pk_v5h_mc/best_v5h_mc.pt — v5h-mc 3-class softmax (val_acc 91-94%).

Eval target

50-seed real-robot rollouts. Compare:

  • % sharp: should be ≤40% (best so far is gradgate step 400 = 40%)
  • % safe (left + right): should be ≥60%
  • vs foundation (50% sharp / 25% L / 25% R)
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support