Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
OiQ
/
daa-tokenizers
like
0
Follow
OiQ Labs
Arabic
Latin
tokenizer
moroccan-darija
arabic
bpe
unigram
wordpiece
bbpe
benchmark
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
daa-tokenizers
/
results
2.5 MB
Ctrl+K
Ctrl+K
1 contributor
History:
20 commits
Ouaill
Upload results/external_datasets_eval.json with huggingface_hub
765c3cc
verified
7 days ago
plots
Upload results/plots/dataset_comparison.png with huggingface_hub
7 days ago
atlaset_full_stats.json
485 Bytes
Upload results/atlaset_full_stats.json with huggingface_hub
7 days ago
codeswitch_results.csv
1.69 kB
Upload results/codeswitch_results.csv with huggingface_hub
7 days ago
codeswitch_results.json
3.45 kB
Upload results/codeswitch_results.json with huggingface_hub
7 days ago
dataset_stats.csv
807 Bytes
Upload results/dataset_stats.csv with huggingface_hub
7 days ago
dataset_stats.json
2.51 kB
Upload results/dataset_stats.json with huggingface_hub
7 days ago
doda_independent_results.csv
1.41 kB
Upload results/doda_independent_results.csv with huggingface_hub
7 days ago
doda_independent_results.json
3.5 kB
Upload results/doda_independent_results.json with huggingface_hub
7 days ago
external_comparison.csv
2.74 kB
Upload results/external_comparison.csv with huggingface_hub
7 days ago
external_datasets_eval.csv
1.6 kB
Upload results/external_datasets_eval.csv with huggingface_hub
7 days ago
external_datasets_eval.json
5.66 kB
Upload results/external_datasets_eval.json with huggingface_hub
7 days ago