Update README.md
Browse files
README.md
CHANGED
|
@@ -13,7 +13,7 @@ datasets:
|
|
| 13 |
- dataflare/egypt-legal-corpus
|
| 14 |
---
|
| 15 |
|
| 16 |
-
# DF-Arc
|
| 17 |
|
| 18 |
**DF-Arc** is a specialized Arabic tokenizer that minimizes the "Arabic Token Tax" by combining **Morphological Pre-tokenization** with **PMI-based Phrase Merging**.
|
| 19 |
|
|
|
|
| 13 |
- dataflare/egypt-legal-corpus
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# DF-Arc
|
| 17 |
|
| 18 |
**DF-Arc** is a specialized Arabic tokenizer that minimizes the "Arabic Token Tax" by combining **Morphological Pre-tokenization** with **PMI-based Phrase Merging**.
|
| 19 |
|