devngho's picture
Update README.md
d81eb24 verified
metadata
language:
  - ko
library_name: transformers

based on phi-3 tokenizer, expanded 17291 tokens

Following The Optimal Vocabulary Size Predictor, I recommend using this tokenizer with 3-4B model such as phi-3-mini