Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
almaghrabima
/
SARF-Tokenizer
like
0
Arabic
English
tokenizer
arabic
morphology
benchmark
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
main
SARF-Tokenizer
/
tokenizer_benchmark.py
Commit History
Update: rank by parity+efficiency, add Falcon-H1-7B
5362025
almaghrabima
commited on
3 days ago
Add benchmark results: 13 tokenizer comparison
301b160
almaghrabima
commited on
3 days ago