devngho's picture
Update README.md
d81eb24 verified
---
language:
- ko
library_name: transformers
---
based on phi-3 tokenizer, expanded 17291 tokens
Following [The Optimal Vocabulary Size Predictor](https://huggingface.co/spaces/sail/scaling-with-vocab-demo), I recommend using this tokenizer with 3-4B model such as phi-3-mini