alfredgu001324 commited on
Commit
7bdebfa
Β·
verified Β·
1 Parent(s): 91161e1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +91 -0
README.md ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Sim2Sim2Sim Checkpoints
2
+
3
+ Official pretrained checkpoints for [Dynamics Distillation for Efficient and Transferable Control Learning](https://arxiv.org/abs/2605.01516).
4
+
5
+ ## Directory Structure
6
+
7
+ ```
8
+ dynamics_model/ # Stage 1: Learned dynamics models
9
+ β”œβ”€β”€ beamng_bicycle_*/ # Bicycle model variants
10
+ β”œβ”€β”€ beamng_ddm_*/ # Deep Dynamics Model (DDM) variants
11
+ β”œβ”€β”€ beamng_trans_*/ # Transformer-based models
12
+ β”œβ”€β”€ beamng_dytr_*/ # DYTR (Dynamics Transfer) models
13
+ └── beamng_manual_PID_*/ # Manual PID control baselines
14
+
15
+ control_policies/ # Stage 2 & 3: Trained RL policies
16
+ β”œβ”€β”€ PPO____*_bicycle/ # Policies trained on bicycle dynamics
17
+ β”œβ”€β”€ PPO____*_ddm/ # Policies trained on DDM dynamics
18
+ β”œβ”€β”€ PPO____*_trans/ # Policies trained on Transformer dynamics
19
+ β”œβ”€β”€ PPO____*_dytr_ddm/ # Policies trained on DYTR dynamics
20
+ └── PPO____*_oracle/ # Oracle policies (full-state observation)
21
+ ```
22
+
23
+ ## Usage
24
+
25
+ ### Download Specific Model
26
+
27
+ ```python
28
+ from huggingface_hub import hf_hub_download
29
+
30
+ # Download a dynamics model
31
+ dynamics_model = hf_hub_download(
32
+ repo_id="alfredgu001324/Sim2Sim2Sim",
33
+ filename="dynamics_model/beamng_trans_10/best_model.pt"
34
+ )
35
+
36
+ # Download a trained policy
37
+ policy = hf_hub_download(
38
+ repo_id="alfredgu001324/Sim2Sim2Sim",
39
+ filename="control_policies/PPO____R_80000__11_25_10_41_44_840_trans/model_PPO____R_80000__11_25_10_41_44_840_001280.pt"
40
+ )
41
+ ```
42
+
43
+ ### Batch Download
44
+
45
+ ```python
46
+ from huggingface_hub import snapshot_download
47
+
48
+ # Download all checkpoints
49
+ local_dir = snapshot_download(
50
+ repo_id="alfredgu001324/Sim2Sim2Sim",
51
+ cache_dir="./ckpts",
52
+ repo_type="model"
53
+ )
54
+ ```
55
+
56
+ ## Model Details
57
+
58
+ ### Dynamics Models (Stage 1)
59
+
60
+ - **Bicycle**: Simple analytical model serving as baseline
61
+ - **DDM**: Deep Dynamics Model - neural network trained on BeamNG data
62
+ - **Transformer**: Sequence-aware dynamics model using transformer architecture
63
+ - **DYTR**: Dynamics Transfer model for cross-domain generalization
64
+
65
+ ### Control Policies (Stage 2 & 3)
66
+
67
+ All policies trained using PPO with 80,000 environment steps:
68
+ - **trans**: Trained on Transformer dynamics model
69
+ - **ddm**: Trained on DDM dynamics model
70
+ - **dytr_ddm**: Trained on DYTR-wrapped DDM dynamics model
71
+ - **bicycle**: Trained on bicycle model
72
+ - **oracle**: Full-state observation policies
73
+ - **FF**: Feed-forward policies
74
+ - **trans_cond**: Transformer with condition encoding for surface changes
75
+
76
+ ## Citation
77
+
78
+ If you use these checkpoints, please cite the paper:
79
+
80
+ ```bibtex
81
+ @article{GuChittaEtAl2026,
82
+ author = {Gu, Xunjiang and Chitta, Kashyap and Golchoubian, Mahsa and Suplin, Vladimir and Gilitschenski, Igor},
83
+ title = {Dynamics Distillation for Efficient and Transferable Control Learning},
84
+ journal = {arXiv preprint arXiv:2605.01516},
85
+ year = {2026}
86
+ }
87
+ ```
88
+
89
+ ## License
90
+
91
+ Apache 2.0