transformers / docs /source /ko /internal /tokenization_utils.md
AbdulElahGwaith's picture
Upload folder using huggingface_hub
a9bd396 verified

ํ† ํฌ๋‚˜์ด์ €๋ฅผ ์œ„ํ•œ ์œ ํ‹ธ๋ฆฌํ‹ฐ [[utilities-for-tokenizers]]

์ด ํŽ˜์ด์ง€๋Š” ํ† ํฌ๋‚˜์ด์ €์—์„œ ์‚ฌ์šฉ๋˜๋Š” ๋ชจ๋“  ์œ ํ‹ธ๋ฆฌํ‹ฐ ํ•จ์ˆ˜๋“ค์„ ๋‚˜์—ดํ•˜๋ฉฐ, ์ฃผ๋กœ [PreTrainedTokenizer]์™€ [PreTrainedTokenizerFast] ์‚ฌ์ด์˜ ๊ณตํ†ต ๋ฉ”์†Œ๋“œ๋ฅผ ๊ตฌํ˜„ํ•˜๋Š” [~tokenization_utils_base.PreTrainedTokenizerBase] ํด๋ž˜์Šค ์„ ๋‹ค๋ฃน๋‹ˆ๋‹ค.

์ด ํ•จ์ˆ˜๋“ค ๋Œ€๋ถ€๋ถ„์€ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ์˜ ํ† ํฌ๋‚˜์ด์ € ์ฝ”๋“œ๋ฅผ ์—ฐ๊ตฌํ•  ๋•Œ๋งŒ ์œ ์šฉํ•ฉ๋‹ˆ๋‹ค.

PreTrainedTokenizerBase [[transformers.PreTrainedTokenizerBase]]

[[autodoc]] tokenization_utils_base.PreTrainedTokenizerBase

  • call
  • all

Enums ๋ฐ namedtuples [[transformers.tokenization_utils_base.TruncationStrategy]]

[[autodoc]] tokenization_utils_base.TruncationStrategy

[[autodoc]] tokenization_utils_base.CharSpan

[[autodoc]] tokenization_utils_base.TokenSpan