--- base_model: google/gemma-4-26b-a4b-it tags: - text-generation-inference - transformers - gemma4 license: apache-2.0 language: - ko --- # ๐Ÿ‡ฐ๐Ÿ‡ท AIKAR 3 Pro (26B) - Specialist in Korean Reasoning **AIKAR 3 Pro**๋Š” LOOP์—์„œ ๊ฐœ๋ฐœ๋œ 26B ๊ทœ๋ชจ์˜ ์–ธ์–ด ๋ชจ๋ธ๋กœ, ํŠนํžˆ **ํ•œ๊ตญ์–ด ์ถ”๋ก (Reasoning) ๋Šฅ๋ ฅ ๊ทน๋Œ€ํ™”**๋ฅผ ์œ„ํ•ด ์„ค๊ณ„๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์ผ๋ฐ˜์ ์ธ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์ด ๋ณด์—ฌ์ฃผ๋Š” ๋‹จ์ˆœ ์ •๋ณด ์ œ๊ณต์„ ๋„˜์–ด, ๋ณต์žกํ•œ ํ•œ๊ตญ์–ด ๋…ผ๋ฆฌ ๊ตฌ์กฐ ์ดํ•ด, ๋ฌธ๋งฅ์  ์ถ”๋ก , ๋‹ค๋‹จ๊ณ„ ์ˆ˜ํ•™ ๋ฐ ์ฝ”๋”ฉ ๋ฌธ์ œ๋ฅผ ํ•œ๊ตญ์–ด ๋งฅ๋ฝ์—์„œ ํ’€์–ด๋‚ด๋Š” ๋ฐ ์ตœ์ ํ™”๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0) [![Language](https://img.shields.io/badge/Language-Korean%20Priority-red)](https://github.com/AIKAR_LLM) [![Parameters](https://img.shields.io/badge/Params-26B-green)](https://loop.ai) ## โœจ Key Features (ํ•ต์‹ฌ ๊ธฐ๋Šฅ) - **Reasoning Focused (์ถ”๋ก  ์ค‘์‹ฌ):** ๋‹จ์ˆœ ์ƒ์„ฑ ๋ชจ๋ธ์ด ์•„๋‹Œ, ๋…ผ๋ฆฌ์ ์ธ ๋‹จ๊ณ„(Chain-of-Thought)๋ฅผ ๊ฑฐ์ณ ๋‹ต์„ ๋„์ถœํ•˜๋Š” ์ถ”๋ก  ํŠนํ™” ์•„ํ‚คํ…์ฒ˜์ž…๋‹ˆ๋‹ค. ํ•œ๊ตญ์–ด ๋ฌธ๋งฅ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋ฏธ๋ฌ˜ํ•œ ๋‰˜์•™์Šค๋ฅผ ๋…ผ๋ฆฌ ๊ตฌ์กฐ์— ๊ฒฐํ•ฉํ•ฉ๋‹ˆ๋‹ค. - **Korean-Centric Dataset (ํ•œ๊ตญ์–ด ํŠนํ™”):** ํ•œ๊ตญ์–ด์˜ ๋ฌธ๋ฒ•์  ํŠน์„ฑ, ๋ฌธํ™”์  ๋ฐฐ๊ฒฝ, ์ „๋ฌธ ์šฉ์–ด๋ฅผ ๊นŠ์ด ์žˆ๊ฒŒ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๊ตญ์–ด ์ •์ œ ๋ฐ์ดํ„ฐ์…‹์„ ์ค‘์‹ฌ์œผ๋กœ ์‚ฌ์ „ ํ•™์Šต(Pre-training) ๋ฐ ๋ฏธ์„ธ ์กฐ์ •(Fine-tuning)๋˜์—ˆ์Šต๋‹ˆ๋‹ค. - **Efficient 26B Architecture (26B ๊ทœ๋ชจ):** ์ถ”๋ก  ๋Šฅ๋ ฅ์˜ ํšจ์œจ์„ฑ์„ ๊ทน๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด 26B ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ, ์ƒ๋Œ€์ ์œผ๋กœ ์ ์€ VRAM์œผ๋กœ๋„ ๊ณ ์„ฑ๋Šฅ์˜ CoT(Reasoning) ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋„๋ก ์ตœ์ ํ™”๋˜์—ˆ์Šต๋‹ˆ๋‹ค. - **Multi-task Capabilities:** ์ˆ˜ํ•™์  ์‚ฌ๊ณ , ํ”„๋กœ๊ทธ๋ž˜๋ฐ, ๋ฌธํ•™์  ์ถ”๋ก , ๋ฒ•๋ฅ  ๋ฐ ๊ธฐ์ˆ  ๋ฌธ์„œ ํ•ด์„ ๋“ฑ ๋‹ค์–‘ํ•œ ๊ณ ๋„ํ™”๋œ ์ž‘์—…์— ๋Šฅ์ˆ™ํ•ฉ๋‹ˆ๋‹ค. ## ๐Ÿ“Š ๋ชจ๋ธ ๊ตฌ์กฐ (Model Architecture) AIKAR 3 Pro๋Š” ๋Œ€๊ทœ๋ชจ 26B ํŒŒ๋ผ๋ฏธํ„ฐ ๋ ˆ์ด์–ด๋ฅผ ๊ฐ€์ง„ ๋””์ฝ”๋” ์ „์šฉ ํŠธ๋žœ์Šคํฌ๋จธ(Decoder-only Transformer) ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. LOOP ๊ณ ์œ ์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ†ตํ•ด ํ•œ๊ตญ์–ด ํ† ํฐ ์ฒ˜๋ฆฌ ํšจ์œจ์„ 40% ์ด์ƒ ํ–ฅ์ƒ์‹œ์ผœ, ๊ธด ๋ฌธ๋งฅ(Context window)์—์„œ๋„ ์ถ”๋ก  ์ผ๊ด€์„ฑ์„ ์œ ์ง€ํ•ฉ๋‹ˆ๋‹ค. - **ํŒŒ๋ผ๋ฏธํ„ฐ:** 26 Billion - **Context Window:** 32k tokens - **Focus:** Korean Language Understanding, Logical Reasoning, Mathematical Solving ## ๐Ÿ› ๏ธ ํ•™์Šต ๊ณผ์ • (Training Process) AIKAR 3 Pro๋Š” ๋‹ค์Œ ์„ธ ๊ฐ€์ง€ ๋‹จ๊ณ„๋ฅผ ๊ฑฐ์ณ ์™„์„ฑ๋˜์—ˆ์Šต๋‹ˆ๋‹ค: 1. **Advanced Pre-training:** ๋ฐฉ๋Œ€ํ•œ ํ•œ๊ตญ์–ด ๋ฌธ๋ฒ• ๊ต์žฌ, ๋‰ด์Šค, ์ „๋ฌธ ์„œ์  ๋ฐ ๊ณต๊ฐœ ์›น ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šตํ•˜์˜€์Šต๋‹ˆ๋‹ค. 2. **Supervised Fine-Tuning (SFT):** ์ •๊ตํ•˜๊ฒŒ ์„ค๊ณ„๋œ ํ•œ๊ตญ์–ด ์ถ”๋ก  ํŠœํ† ๋ฆฌ์–ผ ๋ฐ์ดํ„ฐ์…‹์„ ํ•™์Šตํ•˜์—ฌ ์ƒ๊ฐ์˜ ํ๋ฆ„(Thought chain)์„ ํ˜•์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. 3. **Reasoning Reinforcement Learning:** ์ธ๊ฐ„์˜ ์„ ํ˜ธ๋„๋ฅผ ๋ฐ˜์˜ํ•œ ํ•œ๊ตญ์–ด ๋…ผ๋ฆฌ ๊ฒ€์ฆ ๋ฃจํ”„๋ฅผ ํ†ตํ•ด, ๋‹จ์ˆœํžˆ ๋‹ต๋งŒ ๋‚ด๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ ๋…ผ๋ฆฌ์ ์œผ๋กœ ํƒ€๋‹นํ•œ ์„ค๋ช…(Rationales)์„ ์ œ๊ณตํ•˜๋„๋ก ์ตœ์ ํ™”๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ## ๐Ÿš€ ์‹œ์ž‘ํ•˜๊ธฐ (Getting Started) ```python from transformers import AutoModelForCausalLM, AutoTokenizer model_id = "DFveloper/AIKAR-3-Pro-unquantized" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", torch_dtype="auto") prompt = """๋ฌธ์ œ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค. 37%์˜ ํ• ์ธ์œจ์„ ์ ์šฉํ•œ ์ƒํ’ˆ์ด 15,000์›์ผ ๋•Œ, ์›๋ž˜ ๊ฐ€๊ฒฉ์€ ์–ผ๋งˆ์ž…๋‹ˆ๊นŒ? ๋‹จ๊ณ„๋ณ„๋กœ ๋…ผ๋ฆฌ์ ์œผ๋กœ ์„ค๋ช…ํ•˜์„ธ์š”.""" inputs = tokenizer(prompt, return_tensors="pt").to("cuda") outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) ``` ## ๐Ÿ“‹ ๋ชจ๋ธ ์‚ฌ์šฉ ์ฃผ์˜์‚ฌํ•ญ (Usage Note) AIKAR 3 Pro๋Š” ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ๊ฐ•ํ™”ํ•œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋‹ต์„ ๋ฐ”๋กœ ์–ป๊ธฐ๋ณด๋‹ค, "๋‹จ๊ณ„์ ์œผ๋กœ ์„ค๋ช…ํ•ด์ค˜(Let's think step by step)" ๋˜๋Š” "๋…ผ๋ฆฌ์  ๊ณผ์ •์„ ์ƒ์„ธํžˆ ์ ์–ด์ค˜"์™€ ๊ฐ™์€ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์‚ฌ์šฉํ•  ๋•Œ ์ตœ์ƒ์˜ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•ฉ๋‹ˆ๋‹ค. ## ๐Ÿค ์ปค๋ฎค๋‹ˆํ‹ฐ ๋ฐ ์—ฐ๋ฝ์ฒ˜ - **Developers:** LOOP Research Team - **Homepage:** [loop.ai](https://loop.ai) (์˜ˆ์‹œ ๋งํฌ) - **Report issues:** [Github Issue Link] --- *ยฉ 2026 LOOP AIKAR Laboratory. All Rights Reserved.*