gpudad
/

groot-pick-place-cube-v1

+---
+license: apache-2.0
+tags:
+- robotics
+- manipulation
+- lerobot
+- groot
+- nvidia
+- lora
+- pick-and-place
+base_model: nvidia/GR00T-N1.5-3B
+datasets:
+- so101-safe-worker
+pipeline_tag: robotics
+---
+# GR00T Pick and Place Cube v1
+A fine-tuned **NVIDIA GR00T N1.5** model for robotic pick-and-place manipulation tasks.
+## Model Description
+This model was fine-tuned using **LoRA** (Low-Rank Adaptation) on the SO-101 robot arm dataset for cube pick-and-place tasks.
+### Training Details
+| Parameter | Value |
+|-----------|-------|
+| Base Model | nvidia/GR00T-N1.5-3B |
+| Fine-tuning Method | LoRA |
+| LoRA Rank | 64 |
+| LoRA Alpha | 16 |
+| Training Steps | 50,000 |
+| Batch Size | 8 |
+| Dataset | 21,557 episodes / 1.9M frames |
+| Task | Pick up cube and place in bin |
+| Cameras | Front + Wrist (128x128) |
+| Action Space | 4D (x, y, z, gripper) |
+### Performance
+- **Training Loss**: 1.17 → 0.12 (90% reduction)
+- **Evaluation Success Rate**: ~60% (with proper action unnormalization)
+## Usage
+```python
+from lerobot.policies.groot.modeling_groot import GrootPolicy
+# Load the model
+policy = GrootPolicy.from_pretrained("gpudad/groot-pick-place-cube-v1")
+policy.to("cuda")
+policy.eval()
+# Use for inference
+action = policy.select_action(observation_batch)
+```
+### Important: Action Unnormalization
+The model outputs actions in normalized [-1, 1] space. For the SO-101 robot:
+- XYZ: [-1, 1]
+- Gripper: needs mapping from [-1, 1] to [0, 2]
+```python
+# Unnormalize actions
+action_min = torch.tensor([-1, -1, -1, 0])
+action_max = torch.tensor([1, 1, 1, 2])
+unnormalized = (action + 1) / 2 * (action_max - action_min) + action_min
+```
+## Framework
+Trained using [LeRobot](https://github.com/huggingface/lerobot) 🤖
+## License
+Apache 2.0 (same as base GR00T model)