yjsm1203 commited on
Commit
91802ec
·
verified ·
1 Parent(s): f4d4206

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +63 -0
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - robotics
5
+ - act
6
+ - vla
7
+ - gearbox-assembly
8
+ - imitation-learning
9
+ metrics:
10
+ - l1_loss
11
+ - kl_divergence
12
+ ---
13
+
14
+ # ACT Policy for Gearbox Assembly (Filtered Demos)
15
+
16
+ This model is an **Action Chunking with Transformers (ACT)** policy trained to perform gearbox assembly tasks. It was trained using behavior cloning on a combination of `rocochallenge2025` and additional collected datasets.
17
+
18
+ ## Model Details
19
+
20
+ - **Model Type**: ACT (Action Chunking with Transformers)
21
+ - **Policy Class**: ACT
22
+ - **Backbone**: ResNet-18
23
+ - **Training Dataset**: Integrated dataset (`rocochallenge2025` + `temp_new_dataset`) containing **241 episodes**.
24
+ - **Episode Length**: Fixed to **12,600 steps** (padded/truncated) to handle variable length recordings.
25
+
26
+ ## Training Configuration
27
+ - **Task Name**: `sim_gearbox_assembly_demos_filtered`
28
+ - **Batch Size**: 32
29
+ - **Chunk Size (Action Horizon)**: 100
30
+ - **KL Weight**: 10
31
+ - **Hidden Dimension**: 512
32
+ - **Feedforward Dimension**: 3200
33
+ - **Learning Rate**: 1e-5
34
+ - **Num Epochs**: ~9500 (Early stopped/Interrupted)
35
+ - **Seed**: 0
36
+
37
+ ## Inputs and Outputs
38
+ - **Observations**:
39
+ - `head_rgb` (240x320)
40
+ - `left_hand_rgb` (240x320)
41
+ - `right_hand_rgb` (240x320)
42
+ - `qpos` (14-dim joint positions)
43
+ - **Actions**:
44
+ - 14-dim combined action vector (7-dim left arm + 7-dim right arm)
45
+
46
+ ## Usages
47
+
48
+ This model can be loaded using the `ACTPolicy` class. Ensure `dataset_stats.pkl` is loaded to normalize/unnormalize observations and actions correctly.
49
+
50
+ ```python
51
+ from policy import ACTPolicy
52
+ import pickle
53
+
54
+ # Load stats
55
+ with open('dataset_stats.pkl', 'rb') as f:
56
+ stats = pickle.load(f)
57
+
58
+ # Load policy
59
+ policy = ACTPolicy(config)
60
+ policy.load_state_dict(torch.load('policy_best.ckpt'))
61
+ policy.cuda()
62
+ policy.eval()
63
+ ```