DrDavis's picture
Upload folder using huggingface_hub
17c6d62 verified

Gemma [[gemma]]

๊ฐœ์š” [[overview]]

Gemma ๋ชจ๋ธ์€ Google์˜ Gemma ํŒ€์ด ์ž‘์„ฑํ•œ Gemma: Open Models Based on Gemini Technology and Research์—์„œ ์ œ์•ˆ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

Gemma ๋ชจ๋ธ์€ 6์กฐ ํ† ํฐ์œผ๋กœ ํ•™์Šต๋˜์—ˆ์œผ๋ฉฐ, 2b์™€ 7b์˜ ๋‘ ๊ฐ€์ง€ ๋ฒ„์ „์œผ๋กœ ์ถœ์‹œ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

๋…ผ๋ฌธ์˜ ์ดˆ๋ก์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค:

์ด ์—ฐ๊ตฌ๋Š” ์–ธ์–ด ์ดํ•ด, ์ถ”๋ก  ๋ฐ ์•ˆ์ „์„ฑ์— ๋Œ€ํ•œ ํ•™์ˆ  ๋ฒค์น˜๋งˆํฌ์—์„œ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ์ƒˆ๋กœ์šด ์˜คํ”ˆ ์–ธ์–ด ๋ชจ๋ธ ๊ณ„์—ด์ธ Gemma๋ฅผ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค. ์šฐ๋ฆฌ๋Š” ๋‘ ๊ฐ€์ง€ ํฌ๊ธฐ(20์–ต ๋ฐ 70์–ต ๋งค๊ฐœ๋ณ€์ˆ˜)์˜ ๋ชจ๋ธ์„ ์ถœ์‹œํ•˜๋ฉฐ, ์‚ฌ์ „ ํ•™์Šต๋œ ์ฒดํฌํฌ์ธํŠธ์™€ ๋ฏธ์„ธ ์กฐ์ •๋œ ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๋ชจ๋‘ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. Gemma๋Š” 18๊ฐœ์˜ ํ…์ŠคํŠธ ๊ธฐ๋ฐ˜ ์ž‘์—… ์ค‘ 11๊ฐœ์—์„œ ์œ ์‚ฌํ•œ ํฌ๊ธฐ์˜ ์˜คํ”ˆ ๋ชจ๋ธ์„ ๋Šฅ๊ฐ€ํ•˜๋ฉฐ, ์šฐ๋ฆฌ๋Š” ๋ชจ๋ธ ๊ฐœ๋ฐœ์— ๋Œ€ํ•œ ์ƒ์„ธํ•œ ์„ค๋ช…๊ณผ ํ•จ๊ป˜ ์•ˆ์ „์„ฑ๊ณผ ์ฑ…์ž„ ์ธก๋ฉด์— ๋Œ€ํ•œ ์ข…ํ•ฉ์ ์ธ ํ‰๊ฐ€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์šฐ๋ฆฌ๋Š” LLM์˜ ์ฑ…์ž„๊ฐ ์žˆ๋Š” ๊ณต๊ฐœ๊ฐ€ ์ตœ์ฒจ๋‹จ ๋ชจ๋ธ์˜ ์•ˆ์ „์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ณ  ๋‹ค์Œ ์„ธ๋Œ€์˜ LLM ํ˜์‹ ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•˜๋‹ค๊ณ  ๋ฏฟ์Šต๋‹ˆ๋‹ค.

ํŒ:

  • ์›๋ณธ ์ฒดํฌํฌ์ธํŠธ๋Š” ๋ณ€ํ™˜ ์Šคํฌ๋ฆฝํŠธ src/transformers/models/gemma/convert_gemma_weights_to_hf.py๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋ณ€ํ™˜ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์ด ๋ชจ๋ธ์€ Arthur Zucker, Younes Belkada, Sanchit Gandhi, Pedro Cuenca๊ฐ€ ๊ธฐ์—ฌํ–ˆ์Šต๋‹ˆ๋‹ค.

GemmaConfig [[transformers.GemmaConfig]]

[[autodoc]] GemmaConfig

GemmaTokenizer [[transformers.GemmaTokenizer]]

[[autodoc]] GemmaTokenizer

GemmaTokenizerFast [[transformers.GemmaTokenizerFast]]

[[autodoc]] GemmaTokenizerFast

GemmaModel [[transformers.GemmaModel]]

[[autodoc]] GemmaModel - forward

GemmaForCausalLM [[transformers.GemmaForCausalLM]]

[[autodoc]] GemmaForCausalLM - forward

GemmaForSequenceClassification [[transformers.GemmaForSequenceClassification]]

[[autodoc]] GemmaForSequenceClassification - forward

GemmaForTokenClassification [[transformers.GemmaForTokenClassification]]

[[autodoc]] GemmaForTokenClassification - forward

FlaxGemmaModel [[transformers.FlaxGemmaModel]]

[[autodoc]] FlaxGemmaModel - call

FlaxGemmaForCausalLM [[transformers.FlaxGemmaForCausalLM]]

[[autodoc]] FlaxGemmaForCausalLM - call