Arabic
arabic
tokenizer
morphology
nlp
dialect
fr3on commited on
Commit
d83bb67
·
verified ·
1 Parent(s): f308444

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ datasets:
13
  - dataflare/egypt-legal-corpus
14
  ---
15
 
16
- # DF-Arc v1.1
17
 
18
  **DF-Arc** is a specialized Arabic tokenizer that minimizes the "Arabic Token Tax" by combining **Morphological Pre-tokenization** with **PMI-based Phrase Merging**.
19
 
 
13
  - dataflare/egypt-legal-corpus
14
  ---
15
 
16
+ # DF-Arc
17
 
18
  **DF-Arc** is a specialized Arabic tokenizer that minimizes the "Arabic Token Tax" by combining **Morphological Pre-tokenization** with **PMI-based Phrase Merging**.
19