Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
dataflare
/
df-arc
like
1
Follow
Dataflare
2
dataflare/arabic-dialect-corpus
dataflare/egypt-legal-corpus
Arabic
arabic
tokenizer
morphology
nlp
dialect
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
b892abd
df-arc
14.5 MB
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
fr3on
Release v1.1: PMI Phrase Merging & Smart Morphology
b892abd
verified
3 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
3 months ago
README.md
1.05 kB
Update README.md
3 months ago
exceptions.txt
979 Bytes
Release v1.1: PMI Phrase Merging & Smart Morphology
3 months ago
phrases.json
1.09 MB
Upload folder using huggingface_hub
3 months ago
tokenization_df_arc.py
11.4 kB
Release v1.1: PMI Phrase Merging & Smart Morphology
3 months ago
tokenizer.json
13.4 MB
xet
Upload folder using huggingface_hub
3 months ago
tokenizer_config.json
440 Bytes
Release v1.1: PMI Phrase Merging & Smart Morphology
3 months ago