adityabhaskara commited on
Commit
6070660
·
verified ·
1 Parent(s): 66c58b7

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,207 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: nvidia/GR00T-N1.5-3B
3
+ library_name: peft
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - base_model:adapter:/home/jovyan/.cache/huggingface/hub/models--nvidia--GR00T-N1.5-3B/snapshots/869830fc749c35f34771aa5209f923ac57e4564e
7
+ - lora
8
+ - transformers
9
+ ---
10
+
11
+ # Model Card for Model ID
12
+
13
+ <!-- Provide a quick summary of what the model is/does. -->
14
+
15
+
16
+
17
+ ## Model Details
18
+
19
+ ### Model Description
20
+
21
+ <!-- Provide a longer summary of what this model is. -->
22
+
23
+
24
+
25
+ - **Developed by:** [More Information Needed]
26
+ - **Funded by [optional]:** [More Information Needed]
27
+ - **Shared by [optional]:** [More Information Needed]
28
+ - **Model type:** [More Information Needed]
29
+ - **Language(s) (NLP):** [More Information Needed]
30
+ - **License:** [More Information Needed]
31
+ - **Finetuned from model [optional]:** [More Information Needed]
32
+
33
+ ### Model Sources [optional]
34
+
35
+ <!-- Provide the basic links for the model. -->
36
+
37
+ - **Repository:** [More Information Needed]
38
+ - **Paper [optional]:** [More Information Needed]
39
+ - **Demo [optional]:** [More Information Needed]
40
+
41
+ ## Uses
42
+
43
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
44
+
45
+ ### Direct Use
46
+
47
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
48
+
49
+ [More Information Needed]
50
+
51
+ ### Downstream Use [optional]
52
+
53
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
54
+
55
+ [More Information Needed]
56
+
57
+ ### Out-of-Scope Use
58
+
59
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
60
+
61
+ [More Information Needed]
62
+
63
+ ## Bias, Risks, and Limitations
64
+
65
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
66
+
67
+ [More Information Needed]
68
+
69
+ ### Recommendations
70
+
71
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
72
+
73
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
74
+
75
+ ## How to Get Started with the Model
76
+
77
+ Use the code below to get started with the model.
78
+
79
+ [More Information Needed]
80
+
81
+ ## Training Details
82
+
83
+ ### Training Data
84
+
85
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
86
+
87
+ [More Information Needed]
88
+
89
+ ### Training Procedure
90
+
91
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
92
+
93
+ #### Preprocessing [optional]
94
+
95
+ [More Information Needed]
96
+
97
+
98
+ #### Training Hyperparameters
99
+
100
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
101
+
102
+ #### Speeds, Sizes, Times [optional]
103
+
104
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
105
+
106
+ [More Information Needed]
107
+
108
+ ## Evaluation
109
+
110
+ <!-- This section describes the evaluation protocols and provides the results. -->
111
+
112
+ ### Testing Data, Factors & Metrics
113
+
114
+ #### Testing Data
115
+
116
+ <!-- This should link to a Dataset Card if possible. -->
117
+
118
+ [More Information Needed]
119
+
120
+ #### Factors
121
+
122
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
123
+
124
+ [More Information Needed]
125
+
126
+ #### Metrics
127
+
128
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
129
+
130
+ [More Information Needed]
131
+
132
+ ### Results
133
+
134
+ [More Information Needed]
135
+
136
+ #### Summary
137
+
138
+
139
+
140
+ ## Model Examination [optional]
141
+
142
+ <!-- Relevant interpretability work for the model goes here -->
143
+
144
+ [More Information Needed]
145
+
146
+ ## Environmental Impact
147
+
148
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
149
+
150
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
151
+
152
+ - **Hardware Type:** [More Information Needed]
153
+ - **Hours used:** [More Information Needed]
154
+ - **Cloud Provider:** [More Information Needed]
155
+ - **Compute Region:** [More Information Needed]
156
+ - **Carbon Emitted:** [More Information Needed]
157
+
158
+ ## Technical Specifications [optional]
159
+
160
+ ### Model Architecture and Objective
161
+
162
+ [More Information Needed]
163
+
164
+ ### Compute Infrastructure
165
+
166
+ [More Information Needed]
167
+
168
+ #### Hardware
169
+
170
+ [More Information Needed]
171
+
172
+ #### Software
173
+
174
+ [More Information Needed]
175
+
176
+ ## Citation [optional]
177
+
178
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
179
+
180
+ **BibTeX:**
181
+
182
+ [More Information Needed]
183
+
184
+ **APA:**
185
+
186
+ [More Information Needed]
187
+
188
+ ## Glossary [optional]
189
+
190
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
191
+
192
+ [More Information Needed]
193
+
194
+ ## More Information [optional]
195
+
196
+ [More Information Needed]
197
+
198
+ ## Model Card Authors [optional]
199
+
200
+ [More Information Needed]
201
+
202
+ ## Model Card Contact
203
+
204
+ [More Information Needed]
205
+ ### Framework versions
206
+
207
+ - PEFT 0.17.0
adapter_config.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alpha_pattern": {},
3
+ "auto_mapping": null,
4
+ "base_model_name_or_path": "/home/jovyan/.cache/huggingface/hub/models--nvidia--GR00T-N1.5-3B/snapshots/869830fc749c35f34771aa5209f923ac57e4564e",
5
+ "bias": "none",
6
+ "corda_config": null,
7
+ "eva_config": null,
8
+ "exclude_modules": null,
9
+ "fan_in_fan_out": false,
10
+ "inference_mode": true,
11
+ "init_lora_weights": true,
12
+ "layer_replication": null,
13
+ "layers_pattern": null,
14
+ "layers_to_transform": null,
15
+ "loftq_config": {},
16
+ "lora_alpha": 16,
17
+ "lora_bias": false,
18
+ "lora_dropout": 0.1,
19
+ "megatron_config": null,
20
+ "megatron_core": "megatron.core",
21
+ "modules_to_save": null,
22
+ "peft_type": "LORA",
23
+ "qalora_group_size": 16,
24
+ "r": 64,
25
+ "rank_pattern": {},
26
+ "revision": null,
27
+ "target_modules": [
28
+ "to_k",
29
+ "to_v",
30
+ "to_q"
31
+ ],
32
+ "target_parameters": null,
33
+ "task_type": "CAUSAL_LM",
34
+ "trainable_token_indices": null,
35
+ "use_dora": false,
36
+ "use_qalora": false,
37
+ "use_rslora": false
38
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:865ad794ecb6c96d4f1978a4c3942bc31d14eca7ab086fbc46e3ffe73b4e0a8c
3
+ size 52447352
experiment_cfg/metadata.json ADDED
@@ -0,0 +1,363 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "new_embodiment": {
3
+ "statistics": {
4
+ "state": {
5
+ "left_arm": {
6
+ "max": [
7
+ 98.73976135253906,
8
+ 96.35901641845703,
9
+ 100.0,
10
+ 99.56765747070312,
11
+ 45.55353927612305
12
+ ],
13
+ "min": [
14
+ -98.61373901367188,
15
+ -99.57752227783203,
16
+ -98.8387680053711,
17
+ -54.29303741455078,
18
+ -99.32589721679688
19
+ ],
20
+ "mean": [
21
+ -21.088638809273196,
22
+ -35.20693478486454,
23
+ 19.685823289968425,
24
+ 74.89012554533487,
25
+ -15.454275804740044
26
+ ],
27
+ "std": [
28
+ 22.468562623476963,
29
+ 50.70564450768566,
30
+ 60.6346817431209,
31
+ 25.006065320464476,
32
+ 16.093950236973185
33
+ ],
34
+ "q01": [
35
+ -56.86354312009074,
36
+ -96.16074863523554,
37
+ -79.80371922660588,
38
+ 6.545855459148827,
39
+ -49.08604811304966
40
+ ],
41
+ "q99": [
42
+ 19.026424908006206,
43
+ 51.89971311508096,
44
+ 94.18904620671552,
45
+ 97.54961751409965,
46
+ 6.3060794522607475
47
+ ]
48
+ },
49
+ "left_gripper": {
50
+ "max": [
51
+ 52.63508987426758
52
+ ],
53
+ "min": [
54
+ 0.0
55
+ ],
56
+ "mean": [
57
+ 7.5337724304084945
58
+ ],
59
+ "std": [
60
+ 9.11128214017158
61
+ ],
62
+ "q01": [
63
+ 1.0089721655563384
64
+ ],
65
+ "q99": [
66
+ 27.065848055626642
67
+ ]
68
+ },
69
+ "right_arm": {
70
+ "max": [
71
+ 99.48848724365234,
72
+ 91.86785125732422,
73
+ 100.0,
74
+ 100.0,
75
+ 98.98451232910156
76
+ ],
77
+ "min": [
78
+ -93.22250366210938,
79
+ -99.83000183105469,
80
+ -98.9276123046875,
81
+ -60.289730072021484,
82
+ -56.58796691894531
83
+ ],
84
+ "mean": [
85
+ 14.676079358105927,
86
+ -26.144114099317058,
87
+ 15.556059136017115,
88
+ 66.67901684164453,
89
+ 14.864458223747778
90
+ ],
91
+ "std": [
92
+ 26.08057703647342,
93
+ 62.66198528046179,
94
+ 73.63385319331137,
95
+ 23.845803446928073,
96
+ 21.03552330309383
97
+ ],
98
+ "q01": [
99
+ -28.967125469159907,
100
+ -97.51646477548387,
101
+ -89.09030375660329,
102
+ 7.580964851955197,
103
+ -14.60383211393018
104
+ ],
105
+ "q99": [
106
+ 62.9508381762599,
107
+ 61.420361127735305,
108
+ 96.21193922802121,
109
+ 98.01781307934789,
110
+ 58.466155534153685
111
+ ]
112
+ },
113
+ "right_gripper": {
114
+ "max": [
115
+ 73.18695831298828
116
+ ],
117
+ "min": [
118
+ 0.2661344110965729
119
+ ],
120
+ "mean": [
121
+ 11.251033519247272
122
+ ],
123
+ "std": [
124
+ 11.582784529582225
125
+ ],
126
+ "q01": [
127
+ 0.9866329932825489
128
+ ],
129
+ "q99": [
130
+ 35.70186523687404
131
+ ]
132
+ }
133
+ },
134
+ "action": {
135
+ "left_arm": {
136
+ "max": [
137
+ 100.0,
138
+ 95.20404052734375,
139
+ 100.0,
140
+ 100.0,
141
+ 45.40709686279297
142
+ ],
143
+ "min": [
144
+ -99.50093841552734,
145
+ -100.0,
146
+ -99.81965637207031,
147
+ -55.373329162597656,
148
+ -100.0
149
+ ],
150
+ "mean": [
151
+ -21.22910693284859,
152
+ -36.26338439868878,
153
+ 18.568807013449508,
154
+ 74.93586327634753,
155
+ -15.500467764663378
156
+ ],
157
+ "std": [
158
+ 22.52412827316075,
159
+ 49.920831151438485,
160
+ 61.12057736991418,
161
+ 25.354975731757932,
162
+ 16.12349195694414
163
+ ],
164
+ "q01": [
165
+ -57.361319609255304,
166
+ -96.24721178829788,
167
+ -81.85097660670694,
168
+ 5.673960330982467,
169
+ -49.26118271199973
170
+ ],
171
+ "q99": [
172
+ 19.103420917251402,
173
+ 50.15322887723065,
174
+ 93.82631644011035,
175
+ 98.25881611488697,
176
+ 6.342428805888878
177
+ ]
178
+ },
179
+ "left_gripper": {
180
+ "max": [
181
+ 52.85481262207031
182
+ ],
183
+ "min": [
184
+ 0.0
185
+ ],
186
+ "mean": [
187
+ 6.93280521894035
188
+ ],
189
+ "std": [
190
+ 9.316508574164935
191
+ ],
192
+ "q01": [
193
+ 0.645436191710222
194
+ ],
195
+ "q99": [
196
+ 27.23252384539723
197
+ ]
198
+ },
199
+ "right_arm": {
200
+ "max": [
201
+ 100.0,
202
+ 90.01692199707031,
203
+ 100.0,
204
+ 100.0,
205
+ 100.0
206
+ ],
207
+ "min": [
208
+ -94.36090087890625,
209
+ -100.0,
210
+ -100.0,
211
+ -61.23424530029297,
212
+ -56.99427795410156
213
+ ],
214
+ "mean": [
215
+ 14.896782436134972,
216
+ -27.36248154104448,
217
+ 14.162390888394311,
218
+ 66.7463894035268,
219
+ 14.970113458597439
220
+ ],
221
+ "std": [
222
+ 26.160950796180696,
223
+ 61.5732505204343,
224
+ 74.36142667953995,
225
+ 24.23392015074816,
226
+ 21.086168962595064
227
+ ],
228
+ "q01": [
229
+ -29.051383386787016,
230
+ -97.62820145871616,
231
+ -92.5892661880274,
232
+ 6.738305758200929,
233
+ -14.648052277776706
234
+ ],
235
+ "q99": [
236
+ 63.28248689076074,
237
+ 59.82874419933647,
238
+ 96.01684254797274,
239
+ 98.98339090186273,
240
+ 58.65582270083417
241
+ ]
242
+ },
243
+ "right_gripper": {
244
+ "max": [
245
+ 73.6632080078125
246
+ ],
247
+ "min": [
248
+ 0.0
249
+ ],
250
+ "mean": [
251
+ 10.531353202983311
252
+ ],
253
+ "std": [
254
+ 12.051799910833976
255
+ ],
256
+ "q01": [
257
+ 0.6764031830470845
258
+ ],
259
+ "q99": [
260
+ 35.97826615669552
261
+ ]
262
+ }
263
+ }
264
+ },
265
+ "modalities": {
266
+ "video": {
267
+ "top": {
268
+ "resolution": [
269
+ 640,
270
+ 480
271
+ ],
272
+ "channels": 3,
273
+ "fps": 30.0
274
+ },
275
+ "left_gripper": {
276
+ "resolution": [
277
+ 640,
278
+ 480
279
+ ],
280
+ "channels": 3,
281
+ "fps": 30.0
282
+ },
283
+ "right_gripper": {
284
+ "resolution": [
285
+ 640,
286
+ 480
287
+ ],
288
+ "channels": 3,
289
+ "fps": 30.0
290
+ }
291
+ },
292
+ "state": {
293
+ "left_arm": {
294
+ "absolute": true,
295
+ "rotation_type": null,
296
+ "shape": [
297
+ 5
298
+ ],
299
+ "continuous": true
300
+ },
301
+ "left_gripper": {
302
+ "absolute": true,
303
+ "rotation_type": null,
304
+ "shape": [
305
+ 1
306
+ ],
307
+ "continuous": true
308
+ },
309
+ "right_arm": {
310
+ "absolute": true,
311
+ "rotation_type": null,
312
+ "shape": [
313
+ 5
314
+ ],
315
+ "continuous": true
316
+ },
317
+ "right_gripper": {
318
+ "absolute": true,
319
+ "rotation_type": null,
320
+ "shape": [
321
+ 1
322
+ ],
323
+ "continuous": true
324
+ }
325
+ },
326
+ "action": {
327
+ "left_arm": {
328
+ "absolute": true,
329
+ "rotation_type": null,
330
+ "shape": [
331
+ 5
332
+ ],
333
+ "continuous": true
334
+ },
335
+ "left_gripper": {
336
+ "absolute": true,
337
+ "rotation_type": null,
338
+ "shape": [
339
+ 1
340
+ ],
341
+ "continuous": true
342
+ },
343
+ "right_arm": {
344
+ "absolute": true,
345
+ "rotation_type": null,
346
+ "shape": [
347
+ 5
348
+ ],
349
+ "continuous": true
350
+ },
351
+ "right_gripper": {
352
+ "absolute": true,
353
+ "rotation_type": null,
354
+ "shape": [
355
+ 1
356
+ ],
357
+ "continuous": true
358
+ }
359
+ }
360
+ },
361
+ "embodiment_tag": "new_embodiment"
362
+ }
363
+ }
optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:877ea8d3ca740110270065117db929935de653231dd4700a52c7c2bd4387d3d4
3
+ size 104959034
rng_state.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1335be632b622c2ecd1d692d16eb7c7c0955745f9f8cd3c962167753b9a0c120
3
+ size 14244
scheduler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8aa3b9a6760c74bf79a9f4eed6c4d72bf6f6fd792743442de74dce267eb170d2
3
+ size 1064
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff