| --- |
| license: apache-2.0 |
| tags: |
| - robotics |
| - inverse-dynamics-model |
| - idm |
| - omx |
| - gr00t |
| --- |
| |
| # IDM-OMX-v3-50K |
|
|
| NVIDIA GR00T IDM (Inverse Dynamics Model) fine-tuned for ROBOTIS OMX-F robot. |
|
|
| ## Model Details |
|
|
| - **Architecture**: SiGLIP2(768d) → VL Transformer(4L) → DiT(8L, 1024d) |
| - **Parameters**: 612M |
| - **Embodiment**: OMX (slot 30) |
| - **Action Space**: 6-DOF delta joint angles (radians), 30 FPS |
| - **Action Horizon**: 16 steps |
|
|
| ## Training |
|
|
| - **Base Model**: IDM-OMX-v2 (20K steps checkpoint) |
| - **Dataset**: 558 episodes, 257,466 frames (OMX combined + SO-101 megamix FK→IK converted) |
| - **Steps**: 50,000 (6.21 epochs) |
| - **Batch Size**: 16 (effective 32 with gradient accumulation) |
| - **Learning Rate**: 1e-4 (cosine schedule) |
| - **Duration**: 5h 08min on RTX 5090 |
| - **Final Train Loss**: 0.2644 |
|
|
| ## Evaluation |
|
|
| | Metric | Baseline (10K) | v2 (20K) | **v3 (50K)** | |
| |--------|:-:|:-:|:-:| |
| | Normalized MAE | 0.6092 | 0.5582 | **0.3551 (-36.4%)** | |
| | Forward Loss | 0.3942 | 0.3453 | **0.2517 (-27.1%)** | |
|
|
| ### Per-Joint MAE (degrees) |
|
|
| | Joint | v2 | v3 | Change | |
| |-------|---:|---:|---:| |
| | shoulder_pan | 82.12 | 71.76 | -12.6% | |
| | shoulder_lift | 83.01 | 72.74 | -12.4% | |
| | elbow_flex | 65.65 | 30.12 | **-54.1%** | |
| | wrist_flex | 141.40 | 96.43 | **-31.8%** | |
| | wrist_roll | 67.87 | 56.43 | -16.9% | |
| | gripper | 6.50 | 4.21 | -35.2% | |
| |
| ## Usage |
| |
| |
| |
| ### Pseudo-label Extraction |
| |
| |
| |
| ## Citation |
| |
| Based on [NVIDIA GR00T IDM](https://github.com/NVIDIA-Omniverse/GR00T-Dreams). |
| |