Bingsu's picture
Create README.md
efdcfc6
---
language:
- ko
tags:
- roberta
- tokenizer only
license:
- mit
---
## ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ๋ฒ„์ „
- transformers: 4.21.2
- datasets: 2.4.0
- tokenizers: 0.12.1
[Bingsu/ko_BBPE_tokenizer_roberta](https://huggingface.co/Bingsu/ko_BBPE_tokenizer_roberta)์™€ ๊ฐ™์€ ๋ฐฉ๋ฒ•์œผ๋กœ ํ›ˆ๋ จํ•œ ํ† ํฌ๋‚˜์ด์ €.
๋‹ค๋งŒ `unicode_normalizer="nfkc"`๋ฅผ ๋บ์Šต๋‹ˆ๋‹ค.
```python
tokenizer = ByteLevelBPETokenizer(trim_offsets=True)
```