A newer version of this model is available: kakaocorp/kanana-1.5-2.1b-instruct-2505

Kanana-1.5 2.1B ๋ชจ๋ธ์„ GRPO๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ Chain-of-Thought์„ ํ•˜๋„๋ก ํ•™์Šต์‹œํ‚จ ๋ชจ๋ธ ์ž…๋‹ˆ๋‹ค.

์•„๋ž˜ ๋ฐ์ดํ„ฐ ์…‹์„ ํ™œ์šฉํ•ด A100 40GB GPU๋กœ 3100 step (์•ฝ 8์‹œ๊ฐ„) ๋งŒํผ ํ•™์Šต ๋˜์—ˆ์Œ. https://huggingface.co/datasets/heegyu/CoT-collection-ko

(์ž์„ธํ•œ ๋‚ด์šฉ ์ถ”๊ฐ€ ์˜ˆ์ •)

System Prompt :

SYSTEM_PROMPT = (
    "์‚ฌ์šฉ์ž์™€ ์–ด์‹œ์Šคํ„ดํŠธ ๊ฐ„์˜ ๋Œ€ํ™”์ž…๋‹ˆ๋‹ค. ์‚ฌ์šฉ์ž๊ฐ€ ์งˆ๋ฌธ์„ ํ•˜๋ฉด ์–ด์‹œ์Šคํ„ดํŠธ๊ฐ€ ์ด๋ฅผ ํ•ด๊ฒฐํ•ฉ๋‹ˆ๋‹ค."
    "์–ด์‹œ์Šคํ„ดํŠธ๋Š” ๋จผ์ € ๋จธ๋ฆฟ์†์œผ๋กœ ์ถ”๋ก  ๊ณผ์ •์„ ์ƒ๊ฐํ•œ ๋‹ค์Œ ์‚ฌ์šฉ์ž์—๊ฒŒ ๋‹ต๋ณ€์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค."
    "์ถ”๋ก  ๊ณผ์ •๊ณผ ๋‹ต๋ณ€์€ ๊ฐ๊ฐ <์ƒ๊ฐ> </์ƒ๊ฐ> ๋ฐ <๋‹ต๋ณ€> </๋‹ต๋ณ€> ํƒœ๊ทธ ์•ˆ์— ๋ฌถ์—ฌ ์žˆ์Šต๋‹ˆ๋‹ค."
    "์˜ˆ์‹œ: <์ƒ๊ฐ> ์ถ”๋ก  ๋‚ด์šฉ </์ƒ๊ฐ> <๋‹ต๋ณ€> ์ตœ์ข… ๋‹ต๋ณ€ </๋‹ต๋ณ€>"
)

๋ชจ๋ธ ์ถœ๋ ฅ ์˜ˆ์‹œ :

User: ๋Œ€ํ•œ๋ฏผ๊ตญ๊ณผ ๋ฏธ๊ตญ์˜ ๋ฏธ๋ž˜ ๊ด€๊ณ„์— ๋Œ€ํ•ด ์ž์„ธํžˆ ์„œ์ˆ ํ•˜์‹œ์˜ค.
Assistant: <์ƒ๊ฐ> ๋Œ€ํ•œ๋ฏผ๊ตญ๊ณผ ๋ฏธ๊ตญ์˜ ๋ฏธ๋ž˜ ๊ด€๊ณ„๋Š” ์–‘๊ตญ ๊ฐ„ ์ •์น˜, ๊ฒฝ์ œ์  ์ƒํ˜ธ ์˜์กด๋„๊ฐ€ ๋†’์•„์งˆ์ˆ˜๋ก ๊ธด๋ฐ€ํ•ด์ง‘๋‹ˆ๋‹ค. ํŠนํžˆ ํ•œ๋ฏธ ์ž์œ ๋ฌด์—ญํ˜‘์ •(FTA) ๋“ฑ์œผ๋กœ ์ธํ•œ ๊ต๋ฅ˜ ํ™•๋Œ€์™€ ๊ตฐ์‚ฌ ๋™๋งน ๊ฐ•ํ™”๊ฐ€ ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์ž…๋‹ˆ๋‹ค.
<์ƒ๊ฐ> ๋˜ํ•œ ๋ฏธ๊ตญ์ด ๋ถํ•ต ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์œ„ํ•ด ๋Œ€ํ•œ๋ฏผ๊ตญ์— ํ˜‘๋ ฅ ์š”์ฒญํ•  ๊ฐ€๋Šฅ์„ฑ๋„ ์žˆ์Šต๋‹ˆ๋‹ค.

<๋‹ต๋ณ€> ๋Œ€ํ•œ๋ฏผ๊ตญ๊ณผ ๋ฏธ๊ตญ์˜ ๋ฏธ๋ž˜ ๊ด€๊ณ„๋Š” ์–‘๊ตญ ๊ฐ„ ์ •์น˜, ๊ฒฝ์ œ์  ์ƒํ˜ธ ์˜์กด๋„๊ฐ€ ๋†’์•„์งˆ์ˆ˜๋ก ๊ธด๋ฐ€ํ•ด์ง‘๋‹ˆ๋‹ค. ํŠนํžˆ ํ•œ๋ฏธ ์ž์œ ๋ฌด์—ญํ˜‘์ •(FTA) ๋“ฑ์œผ๋กœ ์ธ
ํ•œ ๊ต๋ฅ˜ ํ™•๋Œ€์™€ ๊ตฐ์‚ฌ ๋™๋งน ๊ฐ•ํ™”๊ฐ€ ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์ž…๋‹ˆ๋‹ค.
๋˜ํ•œ ๋ถํ•ต ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์œ„ํ•ด ๋ฏธ๊ตญ์ด ๋Œ€ํ•œ๋ฏผ๊ตญ์— ํ˜‘๋ ฅ ์š”์ฒญํ•  ๊ฐ€๋Šฅ์„ฑ๋„ ์žˆ์Šต๋‹ˆ๋‹ค.
--
User: ์นด์นด์˜ค๋Š” ์–ด๋–ค ํšŒ์‚ฌ์•ผ?
Assistant: Llama.generate: 332 prefix-match hit, remaining 17 prompt tokens to eval
<์ƒ๊ฐ> ์นด์นด์˜ค๋Š” ๋Œ€ํ•œ๋ฏผ๊ตญ์˜ ๋Œ€ํ‘œ์ ์ธ IT ๊ธฐ์—… ์ค‘ ํ•˜๋‚˜์ž…๋‹ˆ๋‹ค.
- ๋ฉ”์‹ ์ €(์นดํ†ก) ์„œ๋น„์Šค๋ฅผ ํ†ตํ•ด ๊ตญ๋‚ด์™ธ์—์„œ ๋งค์šฐ ๋„๋ฆฌ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
- ๋‹ค์–‘ํ•œ ํ”Œ๋žซํผ์„ ์šด์˜ํ•˜๊ณ  ์žˆ์–ด, ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ยท์ฝ˜ํ…์ธ  ์ œ์ž‘ ๋“ฑ ์—ฌ๋Ÿฌ ๋ถ„์•ผ์— ์ง„์ถœํ•ด ์žˆ์Šต๋‹ˆ๋‹ค.

<๋‹ต๋ณ€> ์นด์นด์˜ค๋Š” ๋Œ€ํ•œ๋ฏผ๊ตญ์˜ ๋Œ€ํ‘œ์ ์ธ IT ๊ธฐ์—…์œผ๋กœ์„œ ๋ฉ”์‹ ์ €์ธ '์นดํ†ก' ์„œ๋น„์Šค๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์˜จ๋ผ์ธ ์ปค๋ฎค๋‹ˆ์ผ€์ด์…˜๊ณผ ์ฝ˜ํ…์ธ  ์‚ฐ์—… ์ „๋ฐ˜์—์„œ ํ™œ์•ฝํ•˜๋Š”
 ํšŒ์‚ฌ์ž…๋‹ˆ๋‹ค.

license: apache-2.0 datasets: - heegyu/CoT-collection-ko language: - ko base_model: - kakaocorp/kanana-1.5-2.1b-instruct-2505 base_model_relation: finetune pipeline_tag: text-generation tags: - grpo - cot

Downloads last month
1
GGUF
Model size
2B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support