RightNow-Arabic-0.5B-Turbo / tokenizer_fertility.json
Jr23xd23's picture
Upload tokenizer_fertility.json with huggingface_hub
4e286bc verified
{
"words": 368,
"ours_tokens": 664,
"ours_fertility": 1.8043478260869565,
"baseline_tokens": 803,
"baseline_fertility": 2.182065217391304,
"reduction_pct": 17.310087173100865,
"ours_vocab_size": 178697,
"baseline_vocab_size": 151665
}