metadata
language:
- ko
library_name: transformers
based on phi-3 tokenizer, expanded 17291 tokens
Following The Optimal Vocabulary Size Predictor, I recommend using this tokenizer with 3-4B model such as phi-3-mini