ghrua's picture
Initial commit with Dockerfile
8b821fa

Auto ํด๋ž˜์Šค[[auto-classes]]

๋งŽ์€ ๊ฒฝ์šฐ, ์‚ฌ์šฉํ•˜๋ ค๋Š” ์•„ํ‚คํ…์ฒ˜๋Š” from_pretrained() ๋ฉ”์†Œ๋“œ์—์„œ ์ œ๊ณตํ•˜๋Š” ์‚ฌ์ „ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ์˜ ์ด๋ฆ„์ด๋‚˜ ๊ฒฝ๋กœ๋กœ๋ถ€ํ„ฐ ์œ ์ถ”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. AutoClasses๋Š” ์ด ์ž‘์—…์„ ์œ„ํ•ด ์กด์žฌํ•˜๋ฉฐ, ์‚ฌ์ „ ํ•™์Šต๋œ ๋ชจ๋ธ ๊ฐ€์ค‘์น˜/๊ตฌ์„ฑ/๋‹จ์–ด์‚ฌ์ „์— ๋Œ€ํ•œ ์ด๋ฆ„/๊ฒฝ๋กœ๋ฅผ ์ œ๊ณตํ•˜๋ฉด ์ž๋™์œผ๋กœ ๊ด€๋ จ ๋ชจ๋ธ์„ ๊ฐ€์ ธ์˜ค๋„๋ก ๋„์™€์ค๋‹ˆ๋‹ค.

[AutoConfig], [AutoModel], [AutoTokenizer] ์ค‘ ํ•˜๋‚˜๋ฅผ ์ธ์Šคํ„ด์Šคํ™”ํ•˜๋ฉด ํ•ด๋‹น ์•„ํ‚คํ…์ฒ˜์˜ ํด๋ž˜์Šค๋ฅผ ์ง์ ‘ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด,

model = AutoModel.from_pretrained("google-bert/bert-base-cased")

์œ„ ์ฝ”๋“œ๋Š” [BertModel]์˜ ์ธ์Šคํ„ด์Šค์ธ ๋ชจ๋ธ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

๊ฐ ์ž‘์—…์— ๋Œ€ํ•ด ํ•˜๋‚˜์˜ AutoModel ํด๋ž˜์Šค๊ฐ€ ์žˆ์œผ๋ฉฐ, ๊ฐ๊ฐ์˜ ๋ฐฑ์—”๋“œ(PyTorch, TensorFlow ๋˜๋Š” Flax)์— ํ•ด๋‹นํ•˜๋Š” ํด๋ž˜์Šค๊ฐ€ ์กด์žฌํ•ฉ๋‹ˆ๋‹ค.

์ž๋™ ํด๋ž˜์Šค ํ™•์žฅ[[extending-the-auto-classes]]

๊ฐ ์ž๋™ ํด๋ž˜์Šค๋Š” ์‚ฌ์šฉ์ž์˜ ์ปค์Šคํ…€ ํด๋ž˜์Šค๋กœ ํ™•์žฅ๋  ์ˆ˜ ์žˆ๋Š” ๋ฉ”์†Œ๋“œ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, NewModel์ด๋ผ๋Š” ์ปค์Šคํ…€ ๋ชจ๋ธ ํด๋ž˜์Šค๋ฅผ ์ •์˜ํ–ˆ๋‹ค๋ฉด, NewModelConfig๋ฅผ ์ค€๋น„ํ•œ ํ›„ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ์ž๋™ ํด๋ž˜์Šค์— ์ถ”๊ฐ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:

from transformers import AutoConfig, AutoModel

AutoConfig.register("new-model", NewModelConfig)
AutoModel.register(NewModelConfig, NewModel)

์ดํ›„์—๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ์ž๋™ ํด๋ž˜์Šค๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ฒ˜๋Ÿผ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค!

๋งŒ์•ฝ NewModelConfig๊ฐ€ [~transformers.PretrainedConfig]์˜ ์„œ๋ธŒํด๋ž˜์Šค๋ผ๋ฉด, ํ•ด๋‹น model_type ์†์„ฑ์ด ๋“ฑ๋กํ•  ๋•Œ ์‚ฌ์šฉํ•˜๋Š” ํ‚ค(์—ฌ๊ธฐ์„œ๋Š” "new-model")์™€ ๋™์ผํ•˜๊ฒŒ ์„ค์ •๋˜์–ด ์žˆ๋Š”์ง€ ํ™•์ธํ•˜์„ธ์š”.

๋งˆ์ฐฌ๊ฐ€์ง€๋กœ, NewModel์ด [PreTrainedModel]์˜ ์„œ๋ธŒํด๋ž˜์Šค๋ผ๋ฉด, ํ•ด๋‹น config_class ์†์„ฑ์ด ๋“ฑ๋กํ•  ๋•Œ ์‚ฌ์šฉํ•˜๋Š” ํด๋ž˜์Šค(์—ฌ๊ธฐ์„œ๋Š” NewModelConfig)์™€ ๋™์ผํ•˜๊ฒŒ ์„ค์ •๋˜์–ด ์žˆ๋Š”์ง€ ํ™•์ธํ•˜์„ธ์š”.

AutoConfig[[transformers.AutoConfig]]

[[autodoc]] AutoConfig

AutoTokenizer[[transformers.AutoTokenizer]]

[[autodoc]] AutoTokenizer

AutoFeatureExtractor[[transformers.AutoFeatureExtractor]]

[[autodoc]] AutoFeatureExtractor

AutoImageProcessor[[transformers.AutoImageProcessor]]

[[autodoc]] AutoImageProcessor

AutoProcessor[[transformers.AutoProcessor]]

[[autodoc]] AutoProcessor

์ผ๋ฐ˜์ ์ธ ๋ชจ๋ธ ํด๋ž˜์Šค[[generic-model-classes]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ํŠน์ • ํ—ค๋“œ ์—†์ด ๊ธฐ๋ณธ ๋ชจ๋ธ ํด๋ž˜์Šค๋ฅผ ์ธ์Šคํ„ด์Šคํ™”ํ•˜๋Š” ๋ฐ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModel[[transformers.AutoModel]]

[[autodoc]] AutoModel

TFAutoModel[[transformers.TFAutoModel]]

[[autodoc]] TFAutoModel

FlaxAutoModel[[transformers.FlaxAutoModel]]

[[autodoc]] FlaxAutoModel

์ผ๋ฐ˜์ ์ธ ์‚ฌ์ „ ํ•™์Šต ํด๋ž˜์Šค[[generic-pretraining-classes]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ์‚ฌ์ „ ํ›ˆ๋ จ ํ—ค๋“œ๊ฐ€ ํฌํ•จ๋œ ๋ชจ๋ธ์„ ์ธ์Šคํ„ด์Šคํ™”ํ•˜๋Š” ๋ฐ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModelForPreTraining[[transformers.AutoModelForPreTraining]]

[[autodoc]] AutoModelForPreTraining

TFAutoModelForPreTraining[[transformers.TFAutoModelForPreTraining]]

[[autodoc]] TFAutoModelForPreTraining

FlaxAutoModelForPreTraining[[transformers.FlaxAutoModelForPreTraining]]

[[autodoc]] FlaxAutoModelForPreTraining

์ž์—ฐ์–ด ์ฒ˜๋ฆฌ[[natural-language-processing]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ์•„๋ž˜์˜ ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ ์ž‘์—…์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModelForCausalLM[[transformers.AutoModelForCausalLM]]

[[autodoc]] AutoModelForCausalLM

TFAutoModelForCausalLM[[transformers.TFAutoModelForCausalLM]]

[[autodoc]] TFAutoModelForCausalLM

FlaxAutoModelForCausalLM[[transformers.FlaxAutoModelForCausalLM]]

[[autodoc]] FlaxAutoModelForCausalLM

AutoModelForMaskedLM[[transformers.AutoModelForMaskedLM]]

[[autodoc]] AutoModelForMaskedLM

TFAutoModelForMaskedLM[[transformers.TFAutoModelForMaskedLM]]

[[autodoc]] TFAutoModelForMaskedLM

FlaxAutoModelForMaskedLM[[transformers.FlaxAutoModelForMaskedLM]]

[[autodoc]] FlaxAutoModelForMaskedLM

AutoModelForMaskGeneration[[transformers.AutoModelForMaskGeneration]]

[[autodoc]] AutoModelForMaskGeneration

TFAutoModelForMaskGeneration[[transformers.TFAutoModelForMaskGeneration]]

[[autodoc]] TFAutoModelForMaskGeneration

AutoModelForSeq2SeqLM[[transformers.AutoModelForSeq2SeqLM]]

[[autodoc]] AutoModelForSeq2SeqLM

TFAutoModelForSeq2SeqLM[[transformers.TFAutoModelForSeq2SeqLM]]

[[autodoc]] TFAutoModelForSeq2SeqLM

FlaxAutoModelForSeq2SeqLM[[transformers.FlaxAutoModelForSeq2SeqLM]]

[[autodoc]] FlaxAutoModelForSeq2SeqLM

AutoModelForSequenceClassification[[transformers.AutoModelForSequenceClassification]]

[[autodoc]] AutoModelForSequenceClassification

TFAutoModelForSequenceClassification[[transformers.TFAutoModelForSequenceClassification]]

[[autodoc]] TFAutoModelForSequenceClassification

FlaxAutoModelForSequenceClassification[[transformers.FlaxAutoModelForSequenceClassification]]

[[autodoc]] FlaxAutoModelForSequenceClassification

AutoModelForMultipleChoice[[transformers.AutoModelForMultipleChoice]]

[[autodoc]] AutoModelForMultipleChoice

TFAutoModelForMultipleChoice[[transformers.TFAutoModelForMultipleChoice]]

[[autodoc]] TFAutoModelForMultipleChoice

FlaxAutoModelForMultipleChoice[[transformers.FlaxAutoModelForMultipleChoice]]

[[autodoc]] FlaxAutoModelForMultipleChoice

AutoModelForNextSentencePrediction[[transformers.AutoModelForNextSentencePrediction]]

[[autodoc]] AutoModelForNextSentencePrediction

TFAutoModelForNextSentencePrediction[[transformers.TFAutoModelForNextSentencePrediction]]

[[autodoc]] TFAutoModelForNextSentencePrediction

FlaxAutoModelForNextSentencePrediction[[transformers.FlaxAutoModelForNextSentencePrediction]]

[[autodoc]] FlaxAutoModelForNextSentencePrediction

AutoModelForTokenClassification[[transformers.AutoModelForTokenClassification]]

[[autodoc]] AutoModelForTokenClassification

TFAutoModelForTokenClassification[[transformers.TFAutoModelForTokenClassification]]

[[autodoc]] TFAutoModelForTokenClassification

FlaxAutoModelForTokenClassification[[transformers.FlaxAutoModelForTokenClassification]]

[[autodoc]] FlaxAutoModelForTokenClassification

AutoModelForQuestionAnswering[[transformers.AutoModelForQuestionAnswering]]

[[autodoc]] AutoModelForQuestionAnswering

TFAutoModelForQuestionAnswering[[transformers.TFAutoModelForQuestionAnswering]]

[[autodoc]] TFAutoModelForQuestionAnswering

FlaxAutoModelForQuestionAnswering[[transformers.FlaxAutoModelForQuestionAnswering]]

[[autodoc]] FlaxAutoModelForQuestionAnswering

AutoModelForTextEncoding[[transformers.AutoModelForTextEncoding]]

[[autodoc]] AutoModelForTextEncoding

TFAutoModelForTextEncoding[[transformers.TFAutoModelForTextEncoding]]

[[autodoc]] TFAutoModelForTextEncoding

์ปดํ“จํ„ฐ ๋น„์ „[[computer-vision]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ์•„๋ž˜์˜ ์ปดํ“จํ„ฐ ๋น„์ „ ์ž‘์—…์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModelForDepthEstimation[[transformers.AutoModelForDepthEstimation]]

[[autodoc]] AutoModelForDepthEstimation

AutoModelForImageClassification[[transformers.AutoModelForImageClassification]]

[[autodoc]] AutoModelForImageClassification

TFAutoModelForImageClassification[[transformers.TFAutoModelForImageClassification]]

[[autodoc]] TFAutoModelForImageClassification

FlaxAutoModelForImageClassification[[transformers.FlaxAutoModelForImageClassification]]

[[autodoc]] FlaxAutoModelForImageClassification

AutoModelForVideoClassification[[transformers.AutoModelForVideoClassification]]

[[autodoc]] AutoModelForVideoClassification

AutoModelForKeypointDetection[[transformers.AutoModelForKeypointDetection]]

[[autodoc]] AutoModelForKeypointDetection

AutoModelForMaskedImageModeling[[transformers.AutoModelForMaskedImageModeling]]

[[autodoc]] AutoModelForMaskedImageModeling

TFAutoModelForMaskedImageModeling[[transformers.TFAutoModelForMaskedImageModeling]]

[[autodoc]] TFAutoModelForMaskedImageModeling

AutoModelForObjectDetection[[transformers.AutoModelForObjectDetection]]

[[autodoc]] AutoModelForObjectDetection

AutoModelForImageSegmentation[[transformers.AutoModelForImageSegmentation]]

[[autodoc]] AutoModelForImageSegmentation

AutoModelForImageToImage[[transformers.AutoModelForImageToImage]]

[[autodoc]] AutoModelForImageToImage

AutoModelForSemanticSegmentation[[transformers.AutoModelForSemanticSegmentation]]

[[autodoc]] AutoModelForSemanticSegmentation

TFAutoModelForSemanticSegmentation[[transformers.TFAutoModelForSemanticSegmentation]]

[[autodoc]] TFAutoModelForSemanticSegmentation

AutoModelForInstanceSegmentation[[transformers.AutoModelForInstanceSegmentation]]

[[autodoc]] AutoModelForInstanceSegmentation

AutoModelForUniversalSegmentation[[transformers.AutoModelForUniversalSegmentation]]

[[autodoc]] AutoModelForUniversalSegmentation

AutoModelForZeroShotImageClassification[[transformers.AutoModelForZeroShotImageClassification]]

[[autodoc]] AutoModelForZeroShotImageClassification

TFAutoModelForZeroShotImageClassification[[transformers.TFAutoModelForZeroShotImageClassification]]

[[autodoc]] TFAutoModelForZeroShotImageClassification

AutoModelForZeroShotObjectDetection[[transformers.AutoModelForZeroShotObjectDetection]]

[[autodoc]] AutoModelForZeroShotObjectDetection

์˜ค๋””์˜ค[[audio]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ์•„๋ž˜์˜ ์˜ค๋””์˜ค ์ž‘์—…์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModelForAudioClassification[[transformers.AutoModelForAudioClassification]]

[[autodoc]] AutoModelForAudioClassification

TFAutoModelForAudioClassification[[transformers.TFAutoModelForAudioClassification]]

[[autodoc]] TFAutoModelForAudioClassification

AutoModelForAudioFrameClassification[[transformers.AutoModelForAudioFrameClassification]]

[[autodoc]] AutoModelForAudioFrameClassification

AutoModelForCTC[[transformers.AutoModelForCTC]]

[[autodoc]] AutoModelForCTC

AutoModelForSpeechSeq2Seq[[transformers.AutoModelForSpeechSeq2Seq]]

[[autodoc]] AutoModelForSpeechSeq2Seq

TFAutoModelForSpeechSeq2Seq[[transformers.TFAutoModelForSpeechSeq2Seq]]

[[autodoc]] TFAutoModelForSpeechSeq2Seq

FlaxAutoModelForSpeechSeq2Seq[[transformers.FlaxAutoModelForSpeechSeq2Seq]]

[[autodoc]] FlaxAutoModelForSpeechSeq2Seq

AutoModelForAudioXVector[[transformers.AutoModelForAudioXVector]]

[[autodoc]] AutoModelForAudioXVector

AutoModelForTextToSpectrogram[[transformers.AutoModelForTextToSpectrogram]]

[[autodoc]] AutoModelForTextToSpectrogram

AutoModelForTextToWaveform[[transformers.AutoModelForTextToWaveform]]

[[autodoc]] AutoModelForTextToWaveform

๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ[[multimodal]]

๋‹ค์Œ ์ž๋™ ํด๋ž˜์Šค๋“ค์€ ์•„๋ž˜์˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ž‘์—…์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

AutoModelForTableQuestionAnswering[[transformers.AutoModelForTableQuestionAnswering]]

[[autodoc]] AutoModelForTableQuestionAnswering

TFAutoModelForTableQuestionAnswering[[transformers.TFAutoModelForTableQuestionAnswering]]

[[autodoc]] TFAutoModelForTableQuestionAnswering

AutoModelForDocumentQuestionAnswering[[transformers.AutoModelForDocumentQuestionAnswering]]

[[autodoc]] AutoModelForDocumentQuestionAnswering

TFAutoModelForDocumentQuestionAnswering[[transformers.TFAutoModelForDocumentQuestionAnswering]]

[[autodoc]] TFAutoModelForDocumentQuestionAnswering

AutoModelForVisualQuestionAnswering[[transformers.AutoModelForVisualQuestionAnswering]]

[[autodoc]] AutoModelForVisualQuestionAnswering

AutoModelForVision2Seq[[transformers.AutoModelForVision2Seq]]

[[autodoc]] AutoModelForVision2Seq

TFAutoModelForVision2Seq[[transformers.TFAutoModelForVision2Seq]]

[[autodoc]] TFAutoModelForVision2Seq

FlaxAutoModelForVision2Seq[[transformers.FlaxAutoModelForVision2Seq]]

[[autodoc]] FlaxAutoModelForVision2Seq