roco_model_act_2 / README.md
yjsm1203's picture
Upload folder using huggingface_hub
91802ec verified
---
license: mit
tags:
- robotics
- act
- vla
- gearbox-assembly
- imitation-learning
metrics:
- l1_loss
- kl_divergence
---
# ACT Policy for Gearbox Assembly (Filtered Demos)
This model is an **Action Chunking with Transformers (ACT)** policy trained to perform gearbox assembly tasks. It was trained using behavior cloning on a combination of `rocochallenge2025` and additional collected datasets.
## Model Details
- **Model Type**: ACT (Action Chunking with Transformers)
- **Policy Class**: ACT
- **Backbone**: ResNet-18
- **Training Dataset**: Integrated dataset (`rocochallenge2025` + `temp_new_dataset`) containing **241 episodes**.
- **Episode Length**: Fixed to **12,600 steps** (padded/truncated) to handle variable length recordings.
## Training Configuration
- **Task Name**: `sim_gearbox_assembly_demos_filtered`
- **Batch Size**: 32
- **Chunk Size (Action Horizon)**: 100
- **KL Weight**: 10
- **Hidden Dimension**: 512
- **Feedforward Dimension**: 3200
- **Learning Rate**: 1e-5
- **Num Epochs**: ~9500 (Early stopped/Interrupted)
- **Seed**: 0
## Inputs and Outputs
- **Observations**:
- `head_rgb` (240x320)
- `left_hand_rgb` (240x320)
- `right_hand_rgb` (240x320)
- `qpos` (14-dim joint positions)
- **Actions**:
- 14-dim combined action vector (7-dim left arm + 7-dim right arm)
## Usages
This model can be loaded using the `ACTPolicy` class. Ensure `dataset_stats.pkl` is loaded to normalize/unnormalize observations and actions correctly.
```python
from policy import ACTPolicy
import pickle
# Load stats
with open('dataset_stats.pkl', 'rb') as f:
stats = pickle.load(f)
# Load policy
policy = ACTPolicy(config)
policy.load_state_dict(torch.load('policy_best.ckpt'))
policy.cuda()
policy.eval()
```