Upload lexicons/lexicon_en.txt with huggingface_hub eef9b5a verified almaghrabima commited on 2 days ago
Upload lexicons/lexicon_ar.txt with huggingface_hub 453d9e0 verified almaghrabima commited on 2 days ago
Upload morfessor_models/morf_map_reverse.json with huggingface_hub 2772bff verified almaghrabima commited on 2 days ago
Upload morfessor_models/morf_map.json with huggingface_hub 5fee9e3 verified almaghrabima commited on 2 days ago
Upload morfessor_models/morfessor_en.bin with huggingface_hub 77f39b7 verified almaghrabima commited on 2 days ago
Upload morfessor_models/morfessor_ar.bin with huggingface_hub 7309795 verified almaghrabima commited on 2 days ago
Upload benchmark_pypi_full.py with huggingface_hub 4e2b74e verified almaghrabima commited on 5 days ago
Upload tokenizer_config.json with huggingface_hub dfd4850 verified almaghrabima commited on 5 days ago
Upload special_tokens_map.json with huggingface_hub f6b77da verified almaghrabima commited on 5 days ago
Update benchmark results with new tokenizers (Falcon-H1, ALLaM, Hala, Mistral) 55db6a1 verified almaghrabima commited on 6 days ago
Upload benchmark_parallel_results.json with huggingface_hub a88f8a7 verified almaghrabima commited on 6 days ago
Upload test_comprehensive_results.json with huggingface_hub 1e8911f verified almaghrabima commited on 6 days ago
Upload test_comprehensive_million.py with huggingface_hub c24518d verified almaghrabima commited on 6 days ago
Upload benchmark_tiktoken_style.py with huggingface_hub 9770614 verified almaghrabima commited on 6 days ago