Samuael commited on
Commit
167a244
·
verified ·
1 Parent(s): e907d90

Samuael/amBART_1000

Browse files
README.md CHANGED
@@ -1,201 +1,112 @@
1
  ---
2
- library_name: transformers
3
- tags: []
 
 
 
 
 
 
 
4
  ---
5
 
6
- # Model Card for Model ID
7
-
8
- <!-- Provide a quick summary of what the model is/does. -->
9
-
10
-
11
-
12
- ## Model Details
13
-
14
- ### Model Description
15
-
16
- <!-- Provide a longer summary of what this model is. -->
17
-
18
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
-
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
-
28
- ### Model Sources [optional]
29
-
30
- <!-- Provide the basic links for the model. -->
31
-
32
- - **Repository:** [More Information Needed]
33
- - **Paper [optional]:** [More Information Needed]
34
- - **Demo [optional]:** [More Information Needed]
35
-
36
- ## Uses
37
-
38
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
-
40
- ### Direct Use
41
-
42
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
-
44
- [More Information Needed]
45
-
46
- ### Downstream Use [optional]
47
-
48
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
-
50
- [More Information Needed]
51
-
52
- ### Out-of-Scope Use
53
-
54
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
-
56
- [More Information Needed]
57
-
58
- ## Bias, Risks, and Limitations
59
-
60
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
-
62
- [More Information Needed]
63
-
64
- ### Recommendations
65
-
66
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
-
68
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
-
70
- ## How to Get Started with the Model
71
-
72
- Use the code below to get started with the model.
73
-
74
- [More Information Needed]
75
-
76
- ## Training Details
77
-
78
- ### Training Data
79
-
80
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
-
82
- [More Information Needed]
83
-
84
- ### Training Procedure
85
-
86
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
-
88
- #### Preprocessing [optional]
89
-
90
- [More Information Needed]
91
-
92
-
93
- #### Training Hyperparameters
94
-
95
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
-
97
- #### Speeds, Sizes, Times [optional]
98
-
99
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
-
101
- [More Information Needed]
102
-
103
- ## Evaluation
104
-
105
- <!-- This section describes the evaluation protocols and provides the results. -->
106
-
107
- ### Testing Data, Factors & Metrics
108
-
109
- #### Testing Data
110
-
111
- <!-- This should link to a Dataset Card if possible. -->
112
-
113
- [More Information Needed]
114
-
115
- #### Factors
116
-
117
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
-
119
- [More Information Needed]
120
-
121
- #### Metrics
122
-
123
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
-
125
- [More Information Needed]
126
-
127
- ### Results
128
-
129
- [More Information Needed]
130
-
131
- #### Summary
132
-
133
-
134
-
135
- ## Model Examination [optional]
136
-
137
- <!-- Relevant interpretability work for the model goes here -->
138
-
139
- [More Information Needed]
140
-
141
- ## Environmental Impact
142
-
143
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
-
145
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
-
147
- - **Hardware Type:** [More Information Needed]
148
- - **Hours used:** [More Information Needed]
149
- - **Cloud Provider:** [More Information Needed]
150
- - **Compute Region:** [More Information Needed]
151
- - **Carbon Emitted:** [More Information Needed]
152
-
153
- ## Technical Specifications [optional]
154
-
155
- ### Model Architecture and Objective
156
-
157
- [More Information Needed]
158
-
159
- ### Compute Infrastructure
160
-
161
- [More Information Needed]
162
-
163
- #### Hardware
164
-
165
- [More Information Needed]
166
-
167
- #### Software
168
-
169
- [More Information Needed]
170
-
171
- ## Citation [optional]
172
-
173
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
-
175
- **BibTeX:**
176
-
177
- [More Information Needed]
178
-
179
- **APA:**
180
-
181
- [More Information Needed]
182
-
183
- ## Glossary [optional]
184
-
185
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
-
187
- [More Information Needed]
188
-
189
- ## More Information [optional]
190
-
191
- [More Information Needed]
192
-
193
- ## Model Card Authors [optional]
194
-
195
- [More Information Needed]
196
-
197
- ## Model Card Contact
198
-
199
- [More Information Needed]
200
-
201
-
 
1
  ---
2
+ base_model: Samuael/amBART_1000
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - wer
7
+ - bleu
8
+ model-index:
9
+ - name: amBART_261
10
+ results: []
11
  ---
12
 
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # amBART_261
17
+
18
+ This model is a fine-tuned version of [Samuael/amBART_1000](https://huggingface.co/Samuael/amBART_1000) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.9604
21
+ - Wer: 2.7857
22
+ - Cer: 3.6889
23
+ - Bleu: 0.0
24
+ - Lr: 0.02
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 0.02
44
+ - train_batch_size: 1
45
+ - eval_batch_size: 1
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 50
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer | Bleu | Lr |
54
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:------:|:----:|
55
+ | No log | 1.0 | 1 | 3.9328 | 1.0 | 4.6333 | 0.0 | 0.02 |
56
+ | No log | 2.0 | 2 | 4.1008 | 1.0 | 6.1778 | 0.0 | 0.02 |
57
+ | No log | 3.0 | 3 | 3.8971 | 1.0714 | 3.7556 | 0.0 | 0.02 |
58
+ | No log | 4.0 | 4 | 3.5169 | 1.5714 | 6.2889 | 0.0 | 0.02 |
59
+ | No log | 5.0 | 5 | 3.4597 | 10.0714 | 6.1889 | 0.0 | 0.02 |
60
+ | No log | 6.0 | 6 | 3.4714 | 1.0 | 6.3222 | 0.0 | 0.02 |
61
+ | No log | 7.0 | 7 | 3.1601 | 1.0 | 6.0667 | 0.0 | 0.02 |
62
+ | No log | 8.0 | 8 | 2.5631 | 1.0 | 0.7667 | 0.0 | 0.02 |
63
+ | No log | 9.0 | 9 | 2.6357 | 2.0 | 6.3667 | 0.0 | 0.02 |
64
+ | No log | 10.0 | 10 | 3.1707 | 2.3571 | 6.5111 | 0.0 | 0.02 |
65
+ | No log | 11.0 | 11 | 2.9462 | 1.1429 | 0.7 | 0.0 | 0.02 |
66
+ | No log | 12.0 | 12 | 3.0437 | 1.0 | 6.2111 | 0.0 | 0.02 |
67
+ | No log | 13.0 | 13 | 2.6371 | 19.2143 | 8.8667 | 0.0 | 0.02 |
68
+ | No log | 14.0 | 14 | 2.4126 | 7.7143 | 7.1 | 0.0 | 0.02 |
69
+ | No log | 15.0 | 15 | 2.6156 | 19.1429 | 6.1 | 0.0 | 0.02 |
70
+ | No log | 16.0 | 16 | 2.7927 | 19.5714 | 6.1778 | 0.0 | 0.02 |
71
+ | No log | 17.0 | 17 | 2.6685 | 1.0 | 3.3333 | 0.0 | 0.02 |
72
+ | No log | 18.0 | 18 | 2.9460 | 1.0 | 0.8111 | 0.0 | 0.02 |
73
+ | No log | 19.0 | 19 | 3.3183 | 1.0714 | 3.4556 | 0.0 | 0.02 |
74
+ | No log | 20.0 | 20 | 3.7492 | 1.2143 | 3.5222 | 0.0 | 0.02 |
75
+ | No log | 21.0 | 21 | 3.8371 | 9.1429 | 6.6111 | 0.0 | 0.02 |
76
+ | No log | 22.0 | 22 | 3.7951 | 13.9286 | 6.3333 | 0.0 | 0.02 |
77
+ | No log | 23.0 | 23 | 3.4253 | 12.0714 | 6.1556 | 0.0 | 0.02 |
78
+ | No log | 24.0 | 24 | 3.4148 | 1.0714 | 0.7333 | 0.0 | 0.02 |
79
+ | No log | 25.0 | 25 | 3.0110 | 8.7143 | 5.9889 | 0.2910 | 0.02 |
80
+ | No log | 26.0 | 26 | 2.7432 | 1.0 | 1.1444 | 0.0 | 0.02 |
81
+ | No log | 27.0 | 27 | 2.5661 | 1.4286 | 0.9333 | 0.0 | 0.02 |
82
+ | No log | 28.0 | 28 | 2.6703 | 1.0 | 3.4889 | 0.0 | 0.02 |
83
+ | No log | 29.0 | 29 | 2.9169 | 18.7143 | 6.1111 | 0.0 | 0.02 |
84
+ | No log | 30.0 | 30 | 3.1300 | 4.0 | 4.3667 | 0.0 | 0.02 |
85
+ | No log | 31.0 | 31 | 3.2927 | 6.0 | 5.6222 | 0.0 | 0.02 |
86
+ | No log | 32.0 | 32 | 3.0442 | 6.5714 | 6.0444 | 0.0 | 0.02 |
87
+ | No log | 33.0 | 33 | 2.7768 | 1.7143 | 3.5222 | 0.0 | 0.02 |
88
+ | No log | 34.0 | 34 | 2.6387 | 1.2857 | 3.4778 | 0.0 | 0.02 |
89
+ | No log | 35.0 | 35 | 2.4790 | 1.2143 | 3.4444 | 0.0 | 0.02 |
90
+ | No log | 36.0 | 36 | 2.3595 | 5.9286 | 4.8111 | 0.0 | 0.02 |
91
+ | No log | 37.0 | 37 | 2.2934 | 7.6429 | 5.3 | 0.0 | 0.02 |
92
+ | No log | 38.0 | 38 | 2.2778 | 1.6429 | 3.7556 | 1.6467 | 0.02 |
93
+ | No log | 39.0 | 39 | 2.2839 | 6.0714 | 4.7333 | 0.0 | 0.02 |
94
+ | No log | 40.0 | 40 | 2.2559 | 1.2857 | 0.8111 | 0.0 | 0.02 |
95
+ | No log | 41.0 | 41 | 2.2032 | 2.5714 | 4.2333 | 0.0 | 0.02 |
96
+ | No log | 42.0 | 42 | 2.1507 | 1.1429 | 3.4444 | 0.0 | 0.02 |
97
+ | No log | 43.0 | 43 | 2.1281 | 1.0 | 0.7556 | 0.0 | 0.02 |
98
+ | No log | 44.0 | 44 | 2.1175 | 1.5714 | 3.4556 | 0.0 | 0.02 |
99
+ | No log | 45.0 | 45 | 2.0781 | 4.5714 | 4.3444 | 0.5569 | 0.02 |
100
+ | No log | 46.0 | 46 | 2.0383 | 1.4286 | 3.3889 | 1.8161 | 0.02 |
101
+ | No log | 47.0 | 47 | 2.0069 | 1.4286 | 3.3889 | 1.8161 | 0.02 |
102
+ | No log | 48.0 | 48 | 1.9878 | 1.3571 | 3.3667 | 0.0 | 0.02 |
103
+ | No log | 49.0 | 49 | 1.9714 | 3.6429 | 3.9556 | 0.0 | 0.02 |
104
+ | No log | 50.0 | 50 | 1.9604 | 2.7857 | 3.6889 | 0.0 | 0.02 |
105
+
106
+
107
+ ### Framework versions
108
+
109
+ - Transformers 4.38.2
110
+ - Pytorch 2.1.0+cu121
111
+ - Datasets 2.18.0
112
+ - Tokenizers 0.15.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
config.json CHANGED
@@ -1,8 +1,9 @@
1
  {
 
2
  "activation_dropout": 0.05,
3
  "activation_function": "gelu",
4
  "architectures": [
5
- "MBartModel"
6
  ],
7
  "attention_dropout": 0.05,
8
  "bos_token_id": 0,
@@ -30,5 +31,5 @@
30
  "torch_dtype": "float32",
31
  "transformers_version": "4.38.2",
32
  "use_cache": true,
33
- "vocab_size": 261
34
  }
 
1
  {
2
+ "_name_or_path": "Samuael/amBART_1000",
3
  "activation_dropout": 0.05,
4
  "activation_function": "gelu",
5
  "architectures": [
6
+ "MBartForConditionalGeneration"
7
  ],
8
  "attention_dropout": 0.05,
9
  "bos_token_id": 0,
 
31
  "torch_dtype": "float32",
32
  "transformers_version": "4.38.2",
33
  "use_cache": true,
34
+ "vocab_size": 1027
35
  }
generation_config.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 0,
3
+ "eos_token_id": 2,
4
+ "forced_eos_token_id": 2,
5
+ "max_length": 300,
6
+ "pad_token_id": 1,
7
+ "transformers_version": "4.38.2"
8
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ac1304aaf5565eaa432fa2e9633e0507268a96652b5160badadeeddbe05b928
3
- size 179238672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac809e866f9edeef888d34b980db5e54540901b296a8d3a00889fbc0694db214
3
+ size 180813204
sentencepiece.bpe.model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a801c63cf0822cc3a880177fd5895196337d7e3813edde88c428061c263354a4
3
- size 240461
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4e4c09bd68c20916dfa8472b3c979527234d7b22afa71e1ea0bb36ee79a1bbd
3
+ size 253571
special_tokens_map.json CHANGED
@@ -1,12 +1,71 @@
1
  {
2
  "additional_special_tokens": [
3
- "",
4
- "ar_AR"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  ],
6
- "bos_token": "<s>",
7
- "cls_token": "<s>",
8
- "eos_token": "</s>",
9
- "pad_token": "<pad>",
10
- "sep_token": "</s>",
11
- "unk_token": "<unk>"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  }
 
1
  {
2
  "additional_special_tokens": [
3
+ "ar_AR",
4
+ "cs_CZ",
5
+ "de_DE",
6
+ "en_XX",
7
+ "es_XX",
8
+ "et_EE",
9
+ "fi_FI",
10
+ "fr_XX",
11
+ "gu_IN",
12
+ "hi_IN",
13
+ "it_IT",
14
+ "ja_XX",
15
+ "kk_KZ",
16
+ "ko_KR",
17
+ "lt_LT",
18
+ "lv_LV",
19
+ "my_MM",
20
+ "ne_NP",
21
+ "nl_XX",
22
+ "ro_RO",
23
+ "ru_RU",
24
+ "si_LK",
25
+ "tr_TR",
26
+ "vi_VN",
27
+ "zh_CN"
28
  ],
29
+ "bos_token": {
30
+ "content": "<s>",
31
+ "lstrip": false,
32
+ "normalized": false,
33
+ "rstrip": false,
34
+ "single_word": false
35
+ },
36
+ "cls_token": {
37
+ "content": "<s>",
38
+ "lstrip": false,
39
+ "normalized": false,
40
+ "rstrip": false,
41
+ "single_word": false
42
+ },
43
+ "eos_token": {
44
+ "content": "</s>",
45
+ "lstrip": false,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false
49
+ },
50
+ "pad_token": {
51
+ "content": "<pad>",
52
+ "lstrip": false,
53
+ "normalized": false,
54
+ "rstrip": false,
55
+ "single_word": false
56
+ },
57
+ "sep_token": {
58
+ "content": "</s>",
59
+ "lstrip": false,
60
+ "normalized": false,
61
+ "rstrip": false,
62
+ "single_word": false
63
+ },
64
+ "unk_token": {
65
+ "content": "<unk>",
66
+ "lstrip": false,
67
+ "normalized": false,
68
+ "rstrip": false,
69
+ "single_word": false
70
+ }
71
  }
tokenizer_config.json CHANGED
@@ -32,7 +32,7 @@
32
  "single_word": false,
33
  "special": true
34
  },
35
- "235": {
36
  "content": "ar_AR",
37
  "lstrip": false,
38
  "normalized": false,
@@ -40,7 +40,7 @@
40
  "single_word": false,
41
  "special": true
42
  },
43
- "236": {
44
  "content": "cs_CZ",
45
  "lstrip": false,
46
  "normalized": false,
@@ -48,7 +48,7 @@
48
  "single_word": false,
49
  "special": true
50
  },
51
- "237": {
52
  "content": "de_DE",
53
  "lstrip": false,
54
  "normalized": false,
@@ -56,7 +56,7 @@
56
  "single_word": false,
57
  "special": true
58
  },
59
- "238": {
60
  "content": "en_XX",
61
  "lstrip": false,
62
  "normalized": false,
@@ -64,7 +64,7 @@
64
  "single_word": false,
65
  "special": true
66
  },
67
- "239": {
68
  "content": "es_XX",
69
  "lstrip": false,
70
  "normalized": false,
@@ -72,7 +72,7 @@
72
  "single_word": false,
73
  "special": true
74
  },
75
- "240": {
76
  "content": "et_EE",
77
  "lstrip": false,
78
  "normalized": false,
@@ -80,7 +80,7 @@
80
  "single_word": false,
81
  "special": true
82
  },
83
- "241": {
84
  "content": "fi_FI",
85
  "lstrip": false,
86
  "normalized": false,
@@ -88,7 +88,7 @@
88
  "single_word": false,
89
  "special": true
90
  },
91
- "242": {
92
  "content": "fr_XX",
93
  "lstrip": false,
94
  "normalized": false,
@@ -96,7 +96,7 @@
96
  "single_word": false,
97
  "special": true
98
  },
99
- "243": {
100
  "content": "gu_IN",
101
  "lstrip": false,
102
  "normalized": false,
@@ -104,7 +104,7 @@
104
  "single_word": false,
105
  "special": true
106
  },
107
- "244": {
108
  "content": "hi_IN",
109
  "lstrip": false,
110
  "normalized": false,
@@ -112,7 +112,7 @@
112
  "single_word": false,
113
  "special": true
114
  },
115
- "245": {
116
  "content": "it_IT",
117
  "lstrip": false,
118
  "normalized": false,
@@ -120,7 +120,7 @@
120
  "single_word": false,
121
  "special": true
122
  },
123
- "246": {
124
  "content": "ja_XX",
125
  "lstrip": false,
126
  "normalized": false,
@@ -128,7 +128,7 @@
128
  "single_word": false,
129
  "special": true
130
  },
131
- "247": {
132
  "content": "kk_KZ",
133
  "lstrip": false,
134
  "normalized": false,
@@ -136,7 +136,7 @@
136
  "single_word": false,
137
  "special": true
138
  },
139
- "248": {
140
  "content": "ko_KR",
141
  "lstrip": false,
142
  "normalized": false,
@@ -144,7 +144,7 @@
144
  "single_word": false,
145
  "special": true
146
  },
147
- "249": {
148
  "content": "lt_LT",
149
  "lstrip": false,
150
  "normalized": false,
@@ -152,7 +152,7 @@
152
  "single_word": false,
153
  "special": true
154
  },
155
- "250": {
156
  "content": "lv_LV",
157
  "lstrip": false,
158
  "normalized": false,
@@ -160,7 +160,7 @@
160
  "single_word": false,
161
  "special": true
162
  },
163
- "251": {
164
  "content": "my_MM",
165
  "lstrip": false,
166
  "normalized": false,
@@ -168,7 +168,7 @@
168
  "single_word": false,
169
  "special": true
170
  },
171
- "252": {
172
  "content": "ne_NP",
173
  "lstrip": false,
174
  "normalized": false,
@@ -176,7 +176,7 @@
176
  "single_word": false,
177
  "special": true
178
  },
179
- "253": {
180
  "content": "nl_XX",
181
  "lstrip": false,
182
  "normalized": false,
@@ -184,7 +184,7 @@
184
  "single_word": false,
185
  "special": true
186
  },
187
- "254": {
188
  "content": "ro_RO",
189
  "lstrip": false,
190
  "normalized": false,
@@ -192,7 +192,7 @@
192
  "single_word": false,
193
  "special": true
194
  },
195
- "255": {
196
  "content": "ru_RU",
197
  "lstrip": false,
198
  "normalized": false,
@@ -200,7 +200,7 @@
200
  "single_word": false,
201
  "special": true
202
  },
203
- "256": {
204
  "content": "si_LK",
205
  "lstrip": false,
206
  "normalized": false,
@@ -208,7 +208,7 @@
208
  "single_word": false,
209
  "special": true
210
  },
211
- "257": {
212
  "content": "tr_TR",
213
  "lstrip": false,
214
  "normalized": false,
@@ -216,7 +216,7 @@
216
  "single_word": false,
217
  "special": true
218
  },
219
- "258": {
220
  "content": "vi_VN",
221
  "lstrip": false,
222
  "normalized": false,
@@ -224,7 +224,7 @@
224
  "single_word": false,
225
  "special": true
226
  },
227
- "259": {
228
  "content": "zh_CN",
229
  "lstrip": false,
230
  "normalized": false,
@@ -234,8 +234,31 @@
234
  }
235
  },
236
  "additional_special_tokens": [
237
- "",
238
- "ar_AR"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
239
  ],
240
  "bos_token": "<s>",
241
  "clean_up_tokenization_spaces": true,
@@ -249,6 +272,5 @@
249
  "src_lang": "ar_AR",
250
  "tgt_lang": "cs_CZ",
251
  "tokenizer_class": "MBartTokenizer",
252
- "tokenizer_file": null,
253
  "unk_token": "<unk>"
254
  }
 
32
  "single_word": false,
33
  "special": true
34
  },
35
+ "1001": {
36
  "content": "ar_AR",
37
  "lstrip": false,
38
  "normalized": false,
 
40
  "single_word": false,
41
  "special": true
42
  },
43
+ "1002": {
44
  "content": "cs_CZ",
45
  "lstrip": false,
46
  "normalized": false,
 
48
  "single_word": false,
49
  "special": true
50
  },
51
+ "1003": {
52
  "content": "de_DE",
53
  "lstrip": false,
54
  "normalized": false,
 
56
  "single_word": false,
57
  "special": true
58
  },
59
+ "1004": {
60
  "content": "en_XX",
61
  "lstrip": false,
62
  "normalized": false,
 
64
  "single_word": false,
65
  "special": true
66
  },
67
+ "1005": {
68
  "content": "es_XX",
69
  "lstrip": false,
70
  "normalized": false,
 
72
  "single_word": false,
73
  "special": true
74
  },
75
+ "1006": {
76
  "content": "et_EE",
77
  "lstrip": false,
78
  "normalized": false,
 
80
  "single_word": false,
81
  "special": true
82
  },
83
+ "1007": {
84
  "content": "fi_FI",
85
  "lstrip": false,
86
  "normalized": false,
 
88
  "single_word": false,
89
  "special": true
90
  },
91
+ "1008": {
92
  "content": "fr_XX",
93
  "lstrip": false,
94
  "normalized": false,
 
96
  "single_word": false,
97
  "special": true
98
  },
99
+ "1009": {
100
  "content": "gu_IN",
101
  "lstrip": false,
102
  "normalized": false,
 
104
  "single_word": false,
105
  "special": true
106
  },
107
+ "1010": {
108
  "content": "hi_IN",
109
  "lstrip": false,
110
  "normalized": false,
 
112
  "single_word": false,
113
  "special": true
114
  },
115
+ "1011": {
116
  "content": "it_IT",
117
  "lstrip": false,
118
  "normalized": false,
 
120
  "single_word": false,
121
  "special": true
122
  },
123
+ "1012": {
124
  "content": "ja_XX",
125
  "lstrip": false,
126
  "normalized": false,
 
128
  "single_word": false,
129
  "special": true
130
  },
131
+ "1013": {
132
  "content": "kk_KZ",
133
  "lstrip": false,
134
  "normalized": false,
 
136
  "single_word": false,
137
  "special": true
138
  },
139
+ "1014": {
140
  "content": "ko_KR",
141
  "lstrip": false,
142
  "normalized": false,
 
144
  "single_word": false,
145
  "special": true
146
  },
147
+ "1015": {
148
  "content": "lt_LT",
149
  "lstrip": false,
150
  "normalized": false,
 
152
  "single_word": false,
153
  "special": true
154
  },
155
+ "1016": {
156
  "content": "lv_LV",
157
  "lstrip": false,
158
  "normalized": false,
 
160
  "single_word": false,
161
  "special": true
162
  },
163
+ "1017": {
164
  "content": "my_MM",
165
  "lstrip": false,
166
  "normalized": false,
 
168
  "single_word": false,
169
  "special": true
170
  },
171
+ "1018": {
172
  "content": "ne_NP",
173
  "lstrip": false,
174
  "normalized": false,
 
176
  "single_word": false,
177
  "special": true
178
  },
179
+ "1019": {
180
  "content": "nl_XX",
181
  "lstrip": false,
182
  "normalized": false,
 
184
  "single_word": false,
185
  "special": true
186
  },
187
+ "1020": {
188
  "content": "ro_RO",
189
  "lstrip": false,
190
  "normalized": false,
 
192
  "single_word": false,
193
  "special": true
194
  },
195
+ "1021": {
196
  "content": "ru_RU",
197
  "lstrip": false,
198
  "normalized": false,
 
200
  "single_word": false,
201
  "special": true
202
  },
203
+ "1022": {
204
  "content": "si_LK",
205
  "lstrip": false,
206
  "normalized": false,
 
208
  "single_word": false,
209
  "special": true
210
  },
211
+ "1023": {
212
  "content": "tr_TR",
213
  "lstrip": false,
214
  "normalized": false,
 
216
  "single_word": false,
217
  "special": true
218
  },
219
+ "1024": {
220
  "content": "vi_VN",
221
  "lstrip": false,
222
  "normalized": false,
 
224
  "single_word": false,
225
  "special": true
226
  },
227
+ "1025": {
228
  "content": "zh_CN",
229
  "lstrip": false,
230
  "normalized": false,
 
234
  }
235
  },
236
  "additional_special_tokens": [
237
+ "ar_AR",
238
+ "cs_CZ",
239
+ "de_DE",
240
+ "en_XX",
241
+ "es_XX",
242
+ "et_EE",
243
+ "fi_FI",
244
+ "fr_XX",
245
+ "gu_IN",
246
+ "hi_IN",
247
+ "it_IT",
248
+ "ja_XX",
249
+ "kk_KZ",
250
+ "ko_KR",
251
+ "lt_LT",
252
+ "lv_LV",
253
+ "my_MM",
254
+ "ne_NP",
255
+ "nl_XX",
256
+ "ro_RO",
257
+ "ru_RU",
258
+ "si_LK",
259
+ "tr_TR",
260
+ "vi_VN",
261
+ "zh_CN"
262
  ],
263
  "bos_token": "<s>",
264
  "clean_up_tokenization_spaces": true,
 
272
  "src_lang": "ar_AR",
273
  "tgt_lang": "cs_CZ",
274
  "tokenizer_class": "MBartTokenizer",
 
275
  "unk_token": "<unk>"
276
  }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2445b6d5f5bc6057cc0c1ec874b25e6b98a995e1b89e258fddb4cab31087bb5
3
+ size 4984