DerivedFunction commited on
Commit
41477bd
·
verified ·
1 Parent(s): c8f45bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -22
README.md CHANGED
@@ -130,28 +130,30 @@ The model supports the following ISO-coded languages:
130
  The coverage is as follows from a sample:
131
 
132
  Per-group coverage (examples / tokens):
133
- | English | 47 examples | 3947 tokens
134
- | Russian | 47 examples | 3665 tokens
135
- | German | 58 examples | 4625 tokens
136
- | Japanese | 50 examples | 4188 tokens
137
- | Chinese | 60 examples | 4131 tokens
138
- | French | 40 examples | 3723 tokens
139
- | Spanish | 44 examples | 4756 tokens
140
- | Portuguese | 27 examples | 2130 tokens
141
- | Italian | 57 examples | 5178 tokens
142
- | Polish | 25 examples | 1753 tokens
143
- | Dutch | | 35 examples | 2315 tokens
144
- | SoutheastAsianLatin | 114 examples | 8861 tokens
145
- | CentralEuropeanLatin | 125 examples | 9761 tokens
146
- | Korean | 38 examples | 3958 tokens
147
- | EastSlavicCyrillic | 85 examples | 7471 tokens
148
- | Arabic | 45 examples | 2508 tokens
149
- | BalkanCyrillic | 71 examples | 6231 tokens
150
- | | Hindi | 33 examples | 3251 tokens
151
- | IndicOther | 261 examples | 40630 tokens
152
- | CentralAsianCyrillic | 57 examples | 3789 tokens
153
- | AfricanLatin | 82 examples | 5910 tokens
154
- | OtherScripts | 269 examples | 28603 tokens
 
 
155
 
156
  Top token languages:
157
  ml 8197
 
130
  The coverage is as follows from a sample:
131
 
132
  Per-group coverage (examples / tokens):
133
+ | language | examples | tokens |
134
+ | --- | -- | -- |
135
+ | English | 47 examples | 3947 tokens |
136
+ | Russian | 47 examples | 3665 tokens |
137
+ | German | 58 examples | 4625 tokens |
138
+ | Japanese | 50 examples | 4188 tokens |
139
+ | Chinese | 60 examples | 4131 tokens |
140
+ | French | 40 examples | 3723 tokens |
141
+ | Spanish | 44 examples | 4756 tokens |
142
+ | Portuguese | 27 examples | 2130 tokens |
143
+ | Italian | 57 examples | 5178 tokens |
144
+ | Polish | 25 examples | 1753 tokens |
145
+ | Dutch | | 35 examples | 2315 tokens |
146
+ | SoutheastAsianLatin | 114 examples | 8861 |
147
+ | CentralEuropeanLatin | 125 examples | 9761 tokens |
148
+ | Korean | 38 examples | 3958 tokens |
149
+ | EastSlavicCyrillic | 85 examples | 7471 tokens |
150
+ | Arabic | 45 examples | 2508 tokens |
151
+ | BalkanCyrillic | 71 examples | 6231 tokens |
152
+ | | Hindi | 33 examples | 3251 tokens |
153
+ | IndicOther | 261 examples | 40630 tokens |
154
+ | CentralAsianCyrillic | 57 examples | 3789 tokens |
155
+ | AfricanLatin | 82 examples | 5910 tokens |
156
+ | OtherScripts | 269 examples | 28603 tokens |
157
 
158
  Top token languages:
159
  ml 8197