deebak14 commited on
Commit
599db80
·
verified ·
1 Parent(s): af261d2

Upload model

Browse files
Files changed (3) hide show
  1. README.md +199 -0
  2. adapter_config.json +523 -0
  3. adapter_model.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,199 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags: []
4
+ ---
5
+
6
+ # Model Card for Model ID
7
+
8
+ <!-- Provide a quick summary of what the model is/does. -->
9
+
10
+
11
+
12
+ ## Model Details
13
+
14
+ ### Model Description
15
+
16
+ <!-- Provide a longer summary of what this model is. -->
17
+
18
+ This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
+
20
+ - **Developed by:** [More Information Needed]
21
+ - **Funded by [optional]:** [More Information Needed]
22
+ - **Shared by [optional]:** [More Information Needed]
23
+ - **Model type:** [More Information Needed]
24
+ - **Language(s) (NLP):** [More Information Needed]
25
+ - **License:** [More Information Needed]
26
+ - **Finetuned from model [optional]:** [More Information Needed]
27
+
28
+ ### Model Sources [optional]
29
+
30
+ <!-- Provide the basic links for the model. -->
31
+
32
+ - **Repository:** [More Information Needed]
33
+ - **Paper [optional]:** [More Information Needed]
34
+ - **Demo [optional]:** [More Information Needed]
35
+
36
+ ## Uses
37
+
38
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
+
40
+ ### Direct Use
41
+
42
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
+
44
+ [More Information Needed]
45
+
46
+ ### Downstream Use [optional]
47
+
48
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
+
50
+ [More Information Needed]
51
+
52
+ ### Out-of-Scope Use
53
+
54
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
+
56
+ [More Information Needed]
57
+
58
+ ## Bias, Risks, and Limitations
59
+
60
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
+
62
+ [More Information Needed]
63
+
64
+ ### Recommendations
65
+
66
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
+
68
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
+
70
+ ## How to Get Started with the Model
71
+
72
+ Use the code below to get started with the model.
73
+
74
+ [More Information Needed]
75
+
76
+ ## Training Details
77
+
78
+ ### Training Data
79
+
80
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
+
82
+ [More Information Needed]
83
+
84
+ ### Training Procedure
85
+
86
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
+
88
+ #### Preprocessing [optional]
89
+
90
+ [More Information Needed]
91
+
92
+
93
+ #### Training Hyperparameters
94
+
95
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
+
97
+ #### Speeds, Sizes, Times [optional]
98
+
99
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
+
101
+ [More Information Needed]
102
+
103
+ ## Evaluation
104
+
105
+ <!-- This section describes the evaluation protocols and provides the results. -->
106
+
107
+ ### Testing Data, Factors & Metrics
108
+
109
+ #### Testing Data
110
+
111
+ <!-- This should link to a Dataset Card if possible. -->
112
+
113
+ [More Information Needed]
114
+
115
+ #### Factors
116
+
117
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
+
119
+ [More Information Needed]
120
+
121
+ #### Metrics
122
+
123
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
+
125
+ [More Information Needed]
126
+
127
+ ### Results
128
+
129
+ [More Information Needed]
130
+
131
+ #### Summary
132
+
133
+
134
+
135
+ ## Model Examination [optional]
136
+
137
+ <!-- Relevant interpretability work for the model goes here -->
138
+
139
+ [More Information Needed]
140
+
141
+ ## Environmental Impact
142
+
143
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
+
145
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
+
147
+ - **Hardware Type:** [More Information Needed]
148
+ - **Hours used:** [More Information Needed]
149
+ - **Cloud Provider:** [More Information Needed]
150
+ - **Compute Region:** [More Information Needed]
151
+ - **Carbon Emitted:** [More Information Needed]
152
+
153
+ ## Technical Specifications [optional]
154
+
155
+ ### Model Architecture and Objective
156
+
157
+ [More Information Needed]
158
+
159
+ ### Compute Infrastructure
160
+
161
+ [More Information Needed]
162
+
163
+ #### Hardware
164
+
165
+ [More Information Needed]
166
+
167
+ #### Software
168
+
169
+ [More Information Needed]
170
+
171
+ ## Citation [optional]
172
+
173
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
+
175
+ **BibTeX:**
176
+
177
+ [More Information Needed]
178
+
179
+ **APA:**
180
+
181
+ [More Information Needed]
182
+
183
+ ## Glossary [optional]
184
+
185
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
+
187
+ [More Information Needed]
188
+
189
+ ## More Information [optional]
190
+
191
+ [More Information Needed]
192
+
193
+ ## Model Card Authors [optional]
194
+
195
+ [More Information Needed]
196
+
197
+ ## Model Card Contact
198
+
199
+ [More Information Needed]
adapter_config.json ADDED
@@ -0,0 +1,523 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alpha_pattern": {},
3
+ "auto_mapping": null,
4
+ "base_model_name_or_path": "Qwen/Qwen2.5-Coder-14B-Instruct",
5
+ "bias": "none",
6
+ "corda_config": null,
7
+ "eva_config": null,
8
+ "exclude_modules": null,
9
+ "fan_in_fan_out": false,
10
+ "inference_mode": true,
11
+ "init_lora_weights": true,
12
+ "layer_replication": null,
13
+ "layers_pattern": null,
14
+ "layers_to_transform": null,
15
+ "loftq_config": {},
16
+ "lora_alpha": 64,
17
+ "lora_bias": false,
18
+ "lora_dropout": 0.01,
19
+ "megatron_config": null,
20
+ "megatron_core": "megatron.core",
21
+ "modules_to_save": [
22
+ "lm_head"
23
+ ],
24
+ "peft_type": "LORA",
25
+ "qalora_group_size": 16,
26
+ "r": 32,
27
+ "rank_pattern": {},
28
+ "revision": null,
29
+ "target_modules": [
30
+ "v_proj",
31
+ "down_proj",
32
+ "q_proj",
33
+ "o_proj",
34
+ "k_proj",
35
+ "up_proj",
36
+ "gate_proj"
37
+ ],
38
+ "task_type": "CAUSAL_LM",
39
+ "trainable_token_indices": {
40
+ "embed_tokens": [
41
+ 151665,
42
+ 151666,
43
+ 151667,
44
+ 151668,
45
+ 151669,
46
+ 151670,
47
+ 151671,
48
+ 151672,
49
+ 151673,
50
+ 151674,
51
+ 151675,
52
+ 151676,
53
+ 151677,
54
+ 151678,
55
+ 151679,
56
+ 151680,
57
+ 151681,
58
+ 151682,
59
+ 151683,
60
+ 151684,
61
+ 151685,
62
+ 151686,
63
+ 151687,
64
+ 151688,
65
+ 151689,
66
+ 151690,
67
+ 151691,
68
+ 151692,
69
+ 151693,
70
+ 151694,
71
+ 151695,
72
+ 151696,
73
+ 151697,
74
+ 151698,
75
+ 151699,
76
+ 151700,
77
+ 151701,
78
+ 151702,
79
+ 151703,
80
+ 151704,
81
+ 151705,
82
+ 151706,
83
+ 151707,
84
+ 151708,
85
+ 151709,
86
+ 151710,
87
+ 151711,
88
+ 151712,
89
+ 151713,
90
+ 151714,
91
+ 151715,
92
+ 151716,
93
+ 151717,
94
+ 151718,
95
+ 151719,
96
+ 151720,
97
+ 151721,
98
+ 151722,
99
+ 151723,
100
+ 151724,
101
+ 151725,
102
+ 151726,
103
+ 151727,
104
+ 151728,
105
+ 151729,
106
+ 151730,
107
+ 151731,
108
+ 151732,
109
+ 151733,
110
+ 151734,
111
+ 151735,
112
+ 151736,
113
+ 151737,
114
+ 151738,
115
+ 151739,
116
+ 151740,
117
+ 151741,
118
+ 151742,
119
+ 151743,
120
+ 151744,
121
+ 151745,
122
+ 151746,
123
+ 151747,
124
+ 151748,
125
+ 151749,
126
+ 151750,
127
+ 151751,
128
+ 151752,
129
+ 151753,
130
+ 151754,
131
+ 151755,
132
+ 151756,
133
+ 151757,
134
+ 151758,
135
+ 151759,
136
+ 151760,
137
+ 151761,
138
+ 151762,
139
+ 151763,
140
+ 151764,
141
+ 151765,
142
+ 151766,
143
+ 151767,
144
+ 151768,
145
+ 151769,
146
+ 151770,
147
+ 151771,
148
+ 151772,
149
+ 151773,
150
+ 151774,
151
+ 151775,
152
+ 151776,
153
+ 151777,
154
+ 151778,
155
+ 151779,
156
+ 151780,
157
+ 151781,
158
+ 151782,
159
+ 151783,
160
+ 151784,
161
+ 151785,
162
+ 151786,
163
+ 151787,
164
+ 151788,
165
+ 151789,
166
+ 151790,
167
+ 151791,
168
+ 8726,
169
+ 66040,
170
+ 151792,
171
+ 151793,
172
+ 151794,
173
+ 151795,
174
+ 151796,
175
+ 151797,
176
+ 151798,
177
+ 151799,
178
+ 151800,
179
+ 151801,
180
+ 151802,
181
+ 151803,
182
+ 151804,
183
+ 151805,
184
+ 151806,
185
+ 151807,
186
+ 151808,
187
+ 151809,
188
+ 151810,
189
+ 151811,
190
+ 151812,
191
+ 151813,
192
+ 151814,
193
+ 151815,
194
+ 151816,
195
+ 151817,
196
+ 151818,
197
+ 151819,
198
+ 151820,
199
+ 151821,
200
+ 151822,
201
+ 151823,
202
+ 151824,
203
+ 151825,
204
+ 151826,
205
+ 151827,
206
+ 151828,
207
+ 151829,
208
+ 151830,
209
+ 151831,
210
+ 151832,
211
+ 151833,
212
+ 151834,
213
+ 151835,
214
+ 151836,
215
+ 151837,
216
+ 151838,
217
+ 151839,
218
+ 151840,
219
+ 151841,
220
+ 151842,
221
+ 151843,
222
+ 151844,
223
+ 151845,
224
+ 151846,
225
+ 151847,
226
+ 151848,
227
+ 151849,
228
+ 151850,
229
+ 151851,
230
+ 151852,
231
+ 151853,
232
+ 151854,
233
+ 151855,
234
+ 151856,
235
+ 151857,
236
+ 151858,
237
+ 151859,
238
+ 151860,
239
+ 151861,
240
+ 151862,
241
+ 151863,
242
+ 151864,
243
+ 151865,
244
+ 151866,
245
+ 151867,
246
+ 151868,
247
+ 151869,
248
+ 151870,
249
+ 151871,
250
+ 151872,
251
+ 151873,
252
+ 151874,
253
+ 151875,
254
+ 151876,
255
+ 151877,
256
+ 151878,
257
+ 151879,
258
+ 151880,
259
+ 151881,
260
+ 151882,
261
+ 151883,
262
+ 151884,
263
+ 151885,
264
+ 151886,
265
+ 151887,
266
+ 151888,
267
+ 151889,
268
+ 151890,
269
+ 151891,
270
+ 151892,
271
+ 151893,
272
+ 151894,
273
+ 151895,
274
+ 151896,
275
+ 151897,
276
+ 151898,
277
+ 151899,
278
+ 151900,
279
+ 151901,
280
+ 151902,
281
+ 151903,
282
+ 151904,
283
+ 151905,
284
+ 151906,
285
+ 151907,
286
+ 151908,
287
+ 151909,
288
+ 151910,
289
+ 151911,
290
+ 151912,
291
+ 151913,
292
+ 151914,
293
+ 151915,
294
+ 151916,
295
+ 151917,
296
+ 151918,
297
+ 151919,
298
+ 151920,
299
+ 151921,
300
+ 151922,
301
+ 151923,
302
+ 151924,
303
+ 151925,
304
+ 151926,
305
+ 151927,
306
+ 151928,
307
+ 151929,
308
+ 151930,
309
+ 151931,
310
+ 151932,
311
+ 151933,
312
+ 151934,
313
+ 151935,
314
+ 151936,
315
+ 151937,
316
+ 151938,
317
+ 151939,
318
+ 151940,
319
+ 151941,
320
+ 151942,
321
+ 151943,
322
+ 151944,
323
+ 151945,
324
+ 151946,
325
+ 151947,
326
+ 151948,
327
+ 151949,
328
+ 151950,
329
+ 151951,
330
+ 151952,
331
+ 151953,
332
+ 151954,
333
+ 151955,
334
+ 151956,
335
+ 151957,
336
+ 151958,
337
+ 151959,
338
+ 151960,
339
+ 151961,
340
+ 151962,
341
+ 151963,
342
+ 151964,
343
+ 151965,
344
+ 151966,
345
+ 151967,
346
+ 151968,
347
+ 151969,
348
+ 151970,
349
+ 151971,
350
+ 151972,
351
+ 151973,
352
+ 151974,
353
+ 151975,
354
+ 151976,
355
+ 151977,
356
+ 151978,
357
+ 151979,
358
+ 151980,
359
+ 151981,
360
+ 151982,
361
+ 151983,
362
+ 151984,
363
+ 151985,
364
+ 151986,
365
+ 151987,
366
+ 151988,
367
+ 151989,
368
+ 151990,
369
+ 151991,
370
+ 151992,
371
+ 151993,
372
+ 151994,
373
+ 151995,
374
+ 151996,
375
+ 151997,
376
+ 151998,
377
+ 151999,
378
+ 152000,
379
+ 152001,
380
+ 152002,
381
+ 152003,
382
+ 152004,
383
+ 152005,
384
+ 152006,
385
+ 152007,
386
+ 152008,
387
+ 152009,
388
+ 15939,
389
+ 152010,
390
+ 152011,
391
+ 152012,
392
+ 152013,
393
+ 152014,
394
+ 152015,
395
+ 152016,
396
+ 152017,
397
+ 152018,
398
+ 152019,
399
+ 14778,
400
+ 152020,
401
+ 152021,
402
+ 152022,
403
+ 41745,
404
+ 152023,
405
+ 152024,
406
+ 152025,
407
+ 152026,
408
+ 152027,
409
+ 152028,
410
+ 152029,
411
+ 152030,
412
+ 152031,
413
+ 152032,
414
+ 152033,
415
+ 152034,
416
+ 152035,
417
+ 152036,
418
+ 152037,
419
+ 152038,
420
+ 152039,
421
+ 152040,
422
+ 152041,
423
+ 152042,
424
+ 152043,
425
+ 37748,
426
+ 152044,
427
+ 152045,
428
+ 152046,
429
+ 152047,
430
+ 49530,
431
+ 152048,
432
+ 152049,
433
+ 152050,
434
+ 152051,
435
+ 152052,
436
+ 152053,
437
+ 152054,
438
+ 152055,
439
+ 152056,
440
+ 152057,
441
+ 152058,
442
+ 152059,
443
+ 152060,
444
+ 152061,
445
+ 152062,
446
+ 152063,
447
+ 152064,
448
+ 152065,
449
+ 152066,
450
+ 152067,
451
+ 152068,
452
+ 152069,
453
+ 152070,
454
+ 152071,
455
+ 152072,
456
+ 152073,
457
+ 152074,
458
+ 152075,
459
+ 152076,
460
+ 152077,
461
+ 152078,
462
+ 152079,
463
+ 152080,
464
+ 152081,
465
+ 152082,
466
+ 152083,
467
+ 152084,
468
+ 152085,
469
+ 152086,
470
+ 152087,
471
+ 152088,
472
+ 152089,
473
+ 152090,
474
+ 152091,
475
+ 152092,
476
+ 152093,
477
+ 152094,
478
+ 152095,
479
+ 152096,
480
+ 152097,
481
+ 152098,
482
+ 152099,
483
+ 152100,
484
+ 152101,
485
+ 152102,
486
+ 152103,
487
+ 152104,
488
+ 152105,
489
+ 152106,
490
+ 152107,
491
+ 152108,
492
+ 152109,
493
+ 152110,
494
+ 152111,
495
+ 152112,
496
+ 152113,
497
+ 152114,
498
+ 152115,
499
+ 152116,
500
+ 152117,
501
+ 152118,
502
+ 152119,
503
+ 152120,
504
+ 152121,
505
+ 152122,
506
+ 152123,
507
+ 152124,
508
+ 152125,
509
+ 152126,
510
+ 152127,
511
+ 152128,
512
+ 152129,
513
+ 152130,
514
+ 152131,
515
+ 152132,
516
+ 152133,
517
+ 152134
518
+ ]
519
+ },
520
+ "use_dora": false,
521
+ "use_qalora": false,
522
+ "use_rslora": false
523
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f93bcf98e6aa419e3545233a436bcdb3caa8b386efd82669d63f8fc9aabc3dfb
3
+ size 6791812448