ACT Policy for Gearbox Assembly (Filtered Demos)

This model is an Action Chunking with Transformers (ACT) policy trained to perform gearbox assembly tasks. It was trained using behavior cloning on a combination of rocochallenge2025 and additional collected datasets.

Model Details

Model Type: ACT (Action Chunking with Transformers)
Policy Class: ACT
Backbone: ResNet-18
Training Dataset: Integrated dataset (rocochallenge2025 + temp_new_dataset) containing 241 episodes.
Episode Length: Fixed to 12,600 steps (padded/truncated) to handle variable length recordings.

Training Configuration

Task Name: sim_gearbox_assembly_demos_filtered
Batch Size: 32
Chunk Size (Action Horizon): 100
KL Weight: 10
Hidden Dimension: 512
Feedforward Dimension: 3200
Learning Rate: 1e-5
Num Epochs: ~9500 (Early stopped/Interrupted)
Seed: 0

Inputs and Outputs

Observations:
- head_rgb (240x320)
- left_hand_rgb (240x320)
- right_hand_rgb (240x320)
- qpos (14-dim joint positions)
Actions:
- 14-dim combined action vector (7-dim left arm + 7-dim right arm)

Usages

This model can be loaded using the ACTPolicy class. Ensure dataset_stats.pkl is loaded to normalize/unnormalize observations and actions correctly.

from policy import ACTPolicy
import pickle

# Load stats
with open('dataset_stats.pkl', 'rb') as f:
    stats = pickle.load(f)

# Load policy
policy = ACTPolicy(config)
policy.load_state_dict(torch.load('policy_best.ckpt'))
policy.cuda()
policy.eval()

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Robotics