DFveloper's picture
Update README.md
9f79749 verified
|
raw
history blame contribute delete
4.36 kB
metadata
base_model: google/gemma-4-26b-a4b-it
tags:
  - text-generation-inference
  - transformers
  - gemma4
license: apache-2.0
language:
  - ko

๐Ÿ‡ฐ๐Ÿ‡ท AIKAR 3 Pro (26B) - Specialist in Korean Reasoning

AIKAR 3 Pro๋Š” LOOP์—์„œ ๊ฐœ๋ฐœ๋œ 26B ๊ทœ๋ชจ์˜ ์–ธ์–ด ๋ชจ๋ธ๋กœ, ํŠนํžˆ ํ•œ๊ตญ์–ด ์ถ”๋ก (Reasoning) ๋Šฅ๋ ฅ ๊ทน๋Œ€ํ™”๋ฅผ ์œ„ํ•ด ์„ค๊ณ„๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์ผ๋ฐ˜์ ์ธ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์ด ๋ณด์—ฌ์ฃผ๋Š” ๋‹จ์ˆœ ์ •๋ณด ์ œ๊ณต์„ ๋„˜์–ด, ๋ณต์žกํ•œ ํ•œ๊ตญ์–ด ๋…ผ๋ฆฌ ๊ตฌ์กฐ ์ดํ•ด, ๋ฌธ๋งฅ์  ์ถ”๋ก , ๋‹ค๋‹จ๊ณ„ ์ˆ˜ํ•™ ๋ฐ ์ฝ”๋”ฉ ๋ฌธ์ œ๋ฅผ ํ•œ๊ตญ์–ด ๋งฅ๋ฝ์—์„œ ํ’€์–ด๋‚ด๋Š” ๋ฐ ์ตœ์ ํ™”๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.

License Language Parameters

โœจ Key Features (ํ•ต์‹ฌ ๊ธฐ๋Šฅ)

  • Reasoning Focused (์ถ”๋ก  ์ค‘์‹ฌ): ๋‹จ์ˆœ ์ƒ์„ฑ ๋ชจ๋ธ์ด ์•„๋‹Œ, ๋…ผ๋ฆฌ์ ์ธ ๋‹จ๊ณ„(Chain-of-Thought)๋ฅผ ๊ฑฐ์ณ ๋‹ต์„ ๋„์ถœํ•˜๋Š” ์ถ”๋ก  ํŠนํ™” ์•„ํ‚คํ…์ฒ˜์ž…๋‹ˆ๋‹ค. ํ•œ๊ตญ์–ด ๋ฌธ๋งฅ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋ฏธ๋ฌ˜ํ•œ ๋‰˜์•™์Šค๋ฅผ ๋…ผ๋ฆฌ ๊ตฌ์กฐ์— ๊ฒฐํ•ฉํ•ฉ๋‹ˆ๋‹ค.
  • Korean-Centric Dataset (ํ•œ๊ตญ์–ด ํŠนํ™”): ํ•œ๊ตญ์–ด์˜ ๋ฌธ๋ฒ•์  ํŠน์„ฑ, ๋ฌธํ™”์  ๋ฐฐ๊ฒฝ, ์ „๋ฌธ ์šฉ์–ด๋ฅผ ๊นŠ์ด ์žˆ๊ฒŒ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๊ตญ์–ด ์ •์ œ ๋ฐ์ดํ„ฐ์…‹์„ ์ค‘์‹ฌ์œผ๋กœ ์‚ฌ์ „ ํ•™์Šต(Pre-training) ๋ฐ ๋ฏธ์„ธ ์กฐ์ •(Fine-tuning)๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
  • Efficient 26B Architecture (26B ๊ทœ๋ชจ): ์ถ”๋ก  ๋Šฅ๋ ฅ์˜ ํšจ์œจ์„ฑ์„ ๊ทน๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด 26B ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ, ์ƒ๋Œ€์ ์œผ๋กœ ์ ์€ VRAM์œผ๋กœ๋„ ๊ณ ์„ฑ๋Šฅ์˜ CoT(Reasoning) ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋„๋ก ์ตœ์ ํ™”๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
  • Multi-task Capabilities: ์ˆ˜ํ•™์  ์‚ฌ๊ณ , ํ”„๋กœ๊ทธ๋ž˜๋ฐ, ๋ฌธํ•™์  ์ถ”๋ก , ๋ฒ•๋ฅ  ๋ฐ ๊ธฐ์ˆ  ๋ฌธ์„œ ํ•ด์„ ๋“ฑ ๋‹ค์–‘ํ•œ ๊ณ ๋„ํ™”๋œ ์ž‘์—…์— ๋Šฅ์ˆ™ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“Š ๋ชจ๋ธ ๊ตฌ์กฐ (Model Architecture)

AIKAR 3 Pro๋Š” ๋Œ€๊ทœ๋ชจ 26B ํŒŒ๋ผ๋ฏธํ„ฐ ๋ ˆ์ด์–ด๋ฅผ ๊ฐ€์ง„ ๋””์ฝ”๋” ์ „์šฉ ํŠธ๋žœ์Šคํฌ๋จธ(Decoder-only Transformer) ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. LOOP ๊ณ ์œ ์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ†ตํ•ด ํ•œ๊ตญ์–ด ํ† ํฐ ์ฒ˜๋ฆฌ ํšจ์œจ์„ 40% ์ด์ƒ ํ–ฅ์ƒ์‹œ์ผœ, ๊ธด ๋ฌธ๋งฅ(Context window)์—์„œ๋„ ์ถ”๋ก  ์ผ๊ด€์„ฑ์„ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค.

  • ํŒŒ๋ผ๋ฏธํ„ฐ: 26 Billion
  • Context Window: 32k tokens
  • Focus: Korean Language Understanding, Logical Reasoning, Mathematical Solving

๐Ÿ› ๏ธ ํ•™์Šต ๊ณผ์ • (Training Process)

AIKAR 3 Pro๋Š” ๋‹ค์Œ ์„ธ ๊ฐ€์ง€ ๋‹จ๊ณ„๋ฅผ ๊ฑฐ์ณ ์™„์„ฑ๋˜์—ˆ์Šต๋‹ˆ๋‹ค:

  1. Advanced Pre-training: ๋ฐฉ๋Œ€ํ•œ ํ•œ๊ตญ์–ด ๋ฌธ๋ฒ• ๊ต์žฌ, ๋‰ด์Šค, ์ „๋ฌธ ์„œ์  ๋ฐ ๊ณต๊ฐœ ์›น ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šตํ•˜์˜€์Šต๋‹ˆ๋‹ค.
  2. Supervised Fine-Tuning (SFT): ์ •๊ตํ•˜๊ฒŒ ์„ค๊ณ„๋œ ํ•œ๊ตญ์–ด ์ถ”๋ก  ํŠœํ† ๋ฆฌ์–ผ ๋ฐ์ดํ„ฐ์…‹์„ ํ•™์Šตํ•˜์—ฌ ์ƒ๊ฐ์˜ ํ๋ฆ„(Thought chain)์„ ํ˜•์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.
  3. Reasoning Reinforcement Learning: ์ธ๊ฐ„์˜ ์„ ํ˜ธ๋„๋ฅผ ๋ฐ˜์˜ํ•œ ํ•œ๊ตญ์–ด ๋…ผ๋ฆฌ ๊ฒ€์ฆ ๋ฃจํ”„๋ฅผ ํ†ตํ•ด, ๋‹จ์ˆœํžˆ ๋‹ต๋งŒ ๋‚ด๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ ๋…ผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ์„ค๋ช…(Rationales)์„ ์ œ๊ณตํ•˜๋„๋ก ์ตœ์ ํ™”๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿš€ ์‹œ์ž‘ํ•˜๊ธฐ (Getting Started)

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "DFveloper/AIKAR-3-Pro-unquantized"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", torch_dtype="auto")

prompt = """๋ฌธ์ œ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค. 37%์˜ ํ• ์ธ์œจ์„ ์ ์šฉํ•œ ์ƒํ’ˆ์ด 15,000์›์ผ ๋•Œ, ์›๋ž˜ ๊ฐ€๊ฒฉ์€ ์–ผ๋งˆ์ž…๋‹ˆ๊นŒ? ๋‹จ๊ณ„๋ณ„๋กœ ๋…ผ๋ฆฌ์ ์œผ๋กœ ์„ค๋ช…ํ•˜์„ธ์š”."""
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")

outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

๐Ÿ“‹ ๋ชจ๋ธ ์‚ฌ์šฉ ์ฃผ์˜์‚ฌํ•ญ (Usage Note)

AIKAR 3 Pro๋Š” ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ๊ฐ•ํ™”ํ•œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋‹ต์„ ๋ฐ”๋กœ ์–ป๊ธฐ๋ณด๋‹ค, "๋‹จ๊ณ„์ ์œผ๋กœ ์„ค๋ช…ํ•ด์ค˜(Let's think step by step)" ๋˜๋Š” "๋…ผ๋ฆฌ์  ๊ณผ์ •์„ ์ƒ์„ธํžˆ ์ ์–ด์ค˜"์™€ ๊ฐ™์€ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์‚ฌ์šฉํ•  ๋•Œ ์ตœ์ƒ์˜ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿค ์ปค๋ฎค๋‹ˆํ‹ฐ ๋ฐ ์—ฐ๋ฝ์ฒ˜

  • Developers: LOOP Research Team
  • Homepage: loop.ai (์˜ˆ์‹œ ๋งํฌ)
  • Report issues: [Github Issue Link]

ยฉ 2026 LOOP AIKAR Laboratory. All Rights Reserved.