DrDavis's picture
Upload folder using huggingface_hub
17c6d62 verified
|
raw
history blame
1.77 kB

ํ† ํฌ๋‚˜์ด์ €๋ฅผ ์œ„ํ•œ ์œ ํ‹ธ๋ฆฌํ‹ฐ [[utilities-for-tokenizers]]

์ด ํŽ˜์ด์ง€๋Š” ํ† ํฌ๋‚˜์ด์ €์—์„œ ์‚ฌ์šฉ๋˜๋Š” ๋ชจ๋“  ์œ ํ‹ธ๋ฆฌํ‹ฐ ํ•จ์ˆ˜๋“ค์„ ๋‚˜์—ดํ•˜๋ฉฐ, ์ฃผ๋กœ [PreTrainedTokenizer]์™€ [PreTrainedTokenizerFast] ์‚ฌ์ด์˜ ๊ณตํ†ต ๋ฉ”์†Œ๋“œ๋ฅผ ๊ตฌํ˜„ํ•˜๋Š” [~tokenization_utils_base.PreTrainedTokenizerBase] ํด๋ž˜์Šค์™€ [~tokenization_utils_base.SpecialTokensMixin]์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค.

์ด ํ•จ์ˆ˜๋“ค ๋Œ€๋ถ€๋ถ„์€ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ ํ† ํฌ๋‚˜์ด์ € ์ฝ”๋“œ๋ฅผ ์—ฐ๊ตฌํ•  ๋•Œ๋งŒ ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.

PreTrainedTokenizerBase [[transformers.PreTrainedTokenizerBase]]

[[autodoc]] tokenization_utils_base.PreTrainedTokenizerBase

  • call
  • all

SpecialTokensMixin [[transformers.SpecialTokensMixin]]

[[autodoc]] tokenization_utils_base.SpecialTokensMixin

Enums ๋ฐ namedtuples [[transformers.tokenization_utils_base.TruncationStrategy]]

[[autodoc]] tokenization_utils_base.TruncationStrategy

[[autodoc]] tokenization_utils_base.CharSpan

[[autodoc]] tokenization_utils_base.TokenSpan