IDM-OMX-v3-50K

NVIDIA GR00T IDM (Inverse Dynamics Model) fine-tuned for ROBOTIS OMX-F robot.

Model Details

  • Architecture: SiGLIP2(768d) → VL Transformer(4L) → DiT(8L, 1024d)
  • Parameters: 612M
  • Embodiment: OMX (slot 30)
  • Action Space: 6-DOF delta joint angles (radians), 30 FPS
  • Action Horizon: 16 steps

Training

  • Base Model: IDM-OMX-v2 (20K steps checkpoint)
  • Dataset: 558 episodes, 257,466 frames (OMX combined + SO-101 megamix FK→IK converted)
  • Steps: 50,000 (6.21 epochs)
  • Batch Size: 16 (effective 32 with gradient accumulation)
  • Learning Rate: 1e-4 (cosine schedule)
  • Duration: 5h 08min on RTX 5090
  • Final Train Loss: 0.2644

Evaluation

Metric Baseline (10K) v2 (20K) v3 (50K)
Normalized MAE 0.6092 0.5582 0.3551 (-36.4%)
Forward Loss 0.3942 0.3453 0.2517 (-27.1%)

Per-Joint MAE (degrees)

Joint v2 v3 Change
shoulder_pan 82.12 71.76 -12.6%
shoulder_lift 83.01 72.74 -12.4%
elbow_flex 65.65 30.12 -54.1%
wrist_flex 141.40 96.43 -31.8%
wrist_roll 67.87 56.43 -16.9%
gripper 6.50 4.21 -35.2%

Usage

Pseudo-label Extraction

Citation

Based on NVIDIA GR00T IDM.

Downloads last month
-
Safetensors
Model size
0.6B params
Tensor type
F32
·
Video Preview
loading