๋ฐ์ดํฐ ์ฝ๋ ์ดํฐ(Data Collator)[[data-collator]]
๋ฐ์ดํฐ ์ฝ๋ ์ดํฐ๋ ๋ฐ์ดํฐ์
์์๋ค์ ๋ฆฌ์คํธ๋ฅผ ์
๋ ฅ์ผ๋ก ์ฌ์ฉํ์ฌ ๋ฐฐ์น๋ฅผ ํ์ฑํ๋ ๊ฐ์ฒด์
๋๋ค. ์ด๋ฌํ ์์๋ค์ train_dataset ๋๋ eval_dataset์ ์์๋ค๊ณผ ๋์ผํ ํ์
์
๋๋ค. ๋ฐฐ์น๋ฅผ ๊ตฌ์ฑํ๊ธฐ ์ํด, ๋ฐ์ดํฐ ์ฝ๋ ์ดํฐ๋ (ํจ๋ฉ๊ณผ ๊ฐ์) ์ผ๋ถ ์ฒ๋ฆฌ๋ฅผ ์ ์ฉํ ์ ์์ต๋๋ค. [DataCollatorForLanguageModeling]๊ณผ ๊ฐ์ ์ผ๋ถ ์ฝ๋ ์ดํฐ๋ ํ์ฑ๋ ๋ฐฐ์น์ (๋ฌด์์ ๋ง์คํน๊ณผ ๊ฐ์) ์ผ๋ถ ๋ฌด์์ ๋ฐ์ดํฐ ์ฆ๊ฐ๋ ์ ์ฉํฉ๋๋ค. ์ฌ์ฉ ์์๋ ์์ ์คํฌ๋ฆฝํธ๋ ์์ ๋
ธํธ๋ถ์์ ์ฐพ์ ์ ์์ต๋๋ค.
๊ธฐ๋ณธ ๋ฐ์ดํฐ ์ฝ๋ ์ดํฐ[[transformers.default_data_collator]]
[[autodoc]] data.data_collator.default_data_collator
DefaultDataCollator[[transformers.DefaultDataCollator]]
[[autodoc]] data.data_collator.DefaultDataCollator
DataCollatorWithPadding[[transformers.DataCollatorWithPadding]]
[[autodoc]] data.data_collator.DataCollatorWithPadding
DataCollatorForTokenClassification[[transformers.DataCollatorForTokenClassification]]
[[autodoc]] data.data_collator.DataCollatorForTokenClassification
DataCollatorForSeq2Seq[[transformers.DataCollatorForSeq2Seq]]
[[autodoc]] data.data_collator.DataCollatorForSeq2Seq
DataCollatorForLanguageModeling[[transformers.DataCollatorForLanguageModeling]]
[[autodoc]] data.data_collator.DataCollatorForLanguageModeling - numpy_mask_tokens - tf_mask_tokens - torch_mask_tokens
DataCollatorForWholeWordMask[[transformers.DataCollatorForWholeWordMask]]
[[autodoc]] data.data_collator.DataCollatorForWholeWordMask - numpy_mask_tokens - tf_mask_tokens - torch_mask_tokens
DataCollatorForPermutationLanguageModeling[[transformers.DataCollatorForPermutationLanguageModeling]]
[[autodoc]] data.data_collator.DataCollatorForPermutationLanguageModeling - numpy_mask_tokens - tf_mask_tokens - torch_mask_tokens
DataCollatorWithFlatteningtransformers.DataCollatorWithFlattening
[[autodoc]] data.data_collator.DataCollatorWithFlattening