DrDavis's picture
Upload folder using huggingface_hub
17c6d62 verified

ν† ν¬λ‚˜μ΄μ €λ₯Ό μœ„ν•œ μœ ν‹Έλ¦¬ν‹° [[utilities-for-tokenizers]]

이 νŽ˜μ΄μ§€λŠ” ν† ν¬λ‚˜μ΄μ €μ—μ„œ μ‚¬μš©λ˜λŠ” λͺ¨λ“  μœ ν‹Έλ¦¬ν‹° ν•¨μˆ˜λ“€μ„ λ‚˜μ—΄ν•˜λ©°, 주둜 [PreTrainedTokenizer]와 [PreTrainedTokenizerFast] μ‚¬μ΄μ˜ 곡톡 λ©”μ†Œλ“œλ₯Ό κ΅¬ν˜„ν•˜λŠ” [~tokenization_utils_base.PreTrainedTokenizerBase] ν΄λž˜μŠ€μ™€ [~tokenization_utils_base.SpecialTokensMixin]을 λ‹€λ£Ήλ‹ˆλ‹€.

이 ν•¨μˆ˜λ“€ λŒ€λΆ€λΆ„μ€ 라이브러리의 ν† ν¬λ‚˜μ΄μ € μ½”λ“œλ₯Ό 연ꡬ할 λ•Œλ§Œ μœ μš©ν•©λ‹ˆλ‹€.

PreTrainedTokenizerBase [[transformers.PreTrainedTokenizerBase]]

[[autodoc]] tokenization_utils_base.PreTrainedTokenizerBase

  • call
  • all

SpecialTokensMixin [[transformers.SpecialTokensMixin]]

[[autodoc]] tokenization_utils_base.SpecialTokensMixin

Enums 및 namedtuples [[transformers.tokenization_utils_base.TruncationStrategy]]

[[autodoc]] tokenization_utils_base.TruncationStrategy

[[autodoc]] tokenization_utils_base.CharSpan

[[autodoc]] tokenization_utils_base.TokenSpan