Jr23xd23 commited on
Commit
4e286bc
·
verified ·
1 Parent(s): a49e8c4

Upload tokenizer_fertility.json with huggingface_hub

Browse files
Files changed (1) hide show
  1. tokenizer_fertility.json +10 -0
tokenizer_fertility.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "words": 368,
3
+ "ours_tokens": 664,
4
+ "ours_fertility": 1.8043478260869565,
5
+ "baseline_tokens": 803,
6
+ "baseline_fertility": 2.182065217391304,
7
+ "reduction_pct": 17.310087173100865,
8
+ "ours_vocab_size": 178697,
9
+ "baseline_vocab_size": 151665
10
+ }