Rewording.
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ tags:
|
|
| 6 |
- legal
|
| 7 |
---
|
| 8 |
|
| 9 |
-
The **Kanon tokenizer** is the world's most space efficient legal document tokenizer of its size.
|
| 10 |
|
| 11 |
With a vocabulary of only 65,536 tokens, documents compressed with the tokenizer are capable of being stored as unsigned 16-bit integers, **reducing memory requirements dramatically over larger vocabularies**.
|
| 12 |
|
|
|
|
| 6 |
- legal
|
| 7 |
---
|
| 8 |
|
| 9 |
+
The **Kanon tokenizer** by [**Isaacus**](https://isaacus.com/) is the world's most space efficient legal document tokenizer of its size.
|
| 10 |
|
| 11 |
With a vocabulary of only 65,536 tokens, documents compressed with the tokenizer are capable of being stored as unsigned 16-bit integers, **reducing memory requirements dramatically over larger vocabularies**.
|
| 12 |
|