라이브러리 버전

  • transformers: 4.21.2
  • datasets: 2.4.0
  • tokenizers: 0.12.1

Bingsu/ko_BBPE_tokenizer_roberta와 같은 방법으로 훈련한 토크나이저.

다만 unicode_normalizer="nfkc"를 뺐습니다.

tokenizer = ByteLevelBPETokenizer(trim_offsets=True)
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support