ACT Policy for Gearbox Assembly (Filtered Demos)

This model is an Action Chunking with Transformers (ACT) policy trained to perform gearbox assembly tasks. It was trained using behavior cloning on a combination of rocochallenge2025 and additional collected datasets.

Model Details

  • Model Type: ACT (Action Chunking with Transformers)
  • Policy Class: ACT
  • Backbone: ResNet-18
  • Training Dataset: Integrated dataset (rocochallenge2025 + temp_new_dataset) containing 241 episodes.
  • Episode Length: Fixed to 12,600 steps (padded/truncated) to handle variable length recordings.

Training Configuration

  • Task Name: sim_gearbox_assembly_demos_filtered
  • Batch Size: 32
  • Chunk Size (Action Horizon): 100
  • KL Weight: 10
  • Hidden Dimension: 512
  • Feedforward Dimension: 3200
  • Learning Rate: 1e-5
  • Num Epochs: ~9500 (Early stopped/Interrupted)
  • Seed: 0

Inputs and Outputs

  • Observations:
    • head_rgb (240x320)
    • left_hand_rgb (240x320)
    • right_hand_rgb (240x320)
    • qpos (14-dim joint positions)
  • Actions:
    • 14-dim combined action vector (7-dim left arm + 7-dim right arm)

Usages

This model can be loaded using the ACTPolicy class. Ensure dataset_stats.pkl is loaded to normalize/unnormalize observations and actions correctly.

from policy import ACTPolicy
import pickle

# Load stats
with open('dataset_stats.pkl', 'rb') as f:
    stats = pickle.load(f)

# Load policy
policy = ACTPolicy(config)
policy.load_state_dict(torch.load('policy_best.ckpt'))
policy.cuda()
policy.eval()
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading