gsaltintas commited on
Commit
078eba3
·
verified ·
1 Parent(s): b579c82

Upload mod_tokenizers/flexitok--mod-tokenizers-rtl_2digit_overlap.json with huggingface_hub

Browse files
mod_tokenizers/flexitok--mod-tokenizers-rtl_2digit_overlap.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"8": {"ratio_to_total_tokens": 0.8411214953271028, "expected_training_ratio_in_superset": 0.7476635514018691, "num_tokens": 90}, "9": {"ratio_to_total_tokens": 0.1588785046728972, "expected_training_ratio_in_superset": 0.1588785046728972, "num_tokens": 17}, "total_training_compared_to_full_model": 0.9065420560747663}