frankenstallm / README.md
somebody-to-love's picture
Upload README.md with huggingface_hub
f45e7ff verified
|
raw
history blame
2.38 kB
metadata
language:
  - ko
license: other
tags:
  - llm
  - korean
  - orpo
  - gguf

FRANKENSTALLM 3B v2 (Byte-Fallback Fixed)

ํ•œ๊ตญ์–ด ์ค‘์‹ฌ FRANKENSTALLM 3B ORPO ํŒŒ์ธํŠœ๋‹ ์ฒดํฌํฌ์ธํŠธ์— byte-fallback ํ† ํฐ 256๊ฐœ๋ฅผ ์ถ”๊ฐ€ํ•œ ๋ฒ„์ „์ž…๋‹ˆ๋‹ค.
llama.cpp/GGUF ์ถ”๋ก  ์‹œ ์ค„๋ฐ”๊ฟˆ(\n) ๋“ฑ ๋ฏธ๋“ฑ๋ก ๋ฌธ์ž๋กœ ์ธํ•œ ํฌ๋ž˜์‹œ๋ฅผ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์ƒ์„ธ

ํ•ญ๋ชฉ ๊ฐ’
Architecture LlamaForCausalLM
Params ~3B
Hidden size 2048
Layers 24
Attention heads 16
KV heads 4
Max position 4096
Vocab size 64,256 (64,000 + 256 byte-fallback)
Training ORPO (SFT โ†’ ORPO)

๋ณ€๊ฒฝ ์‚ฌํ•ญ (v2)

  • ํ† ํฌ๋‚˜์ด์ €: byte_fallback=True, <0x00>~`<0xFF>` 256๊ฐœ ํ† ํฐ ์ถ”๊ฐ€
  • ์ž„๋ฒ ๋”ฉ: 64,000 โ†’ 64,256 ๋ฆฌ์‚ฌ์ด์ฆˆ, ์ƒˆ ํ† ํฐ ์ดˆ๊ธฐํ™”
  • GGUF ๋ณ€ํ™˜ยทOllama ๋ฐฐํฌ ์‹œ ๋‰ด๋ผ์ธ ํฌํ•จ ์ž…๋ ฅ ์ •์ƒ ์ฒ˜๋ฆฌ ํ™•์ธ

ORPO ํ‰๊ฐ€ ์š”์•ฝ (๋™์ผ ์ฒดํฌํฌ์ธํŠธ ๊ธฐ์ค€)

  • ํ‰๊ฐ€ ์ผ์‹œ: 2026-03-09
  • Preference Accuracy: 76.02%
  • Reward Margin: 0.6100
  • Eval Loss: 1.7910 โ†’ 1.6250
  • KoBEST (0-shot) ํ‰๊ท : 52.75%
  • ์ƒ์„ฑ ํ’ˆ์งˆ: Greedy 3-gram ๋ฐ˜๋ณต๋ฅ  30.89%, EOS ์ข…๋ฃŒ์œจ 66.67%
  • PPL Forgetting: ์ตœ๋Œ€ 4.1% (๊ธฐ์ค€ <15%)
  • ์ข…ํ•ฉ: 7/10 ์ฐจ์› ํ†ต๊ณผ, ์ •๋Ÿ‰ ์Šค์ฝ”์–ด 63.7/100

์ƒ์„ธ: ํ”„๋กœ์ ํŠธ ๋‚ด reports/2026-03-09_ORPO_EVALUATION_REPORT.md ์ฐธ๊ณ .

Ollama ๋ฐฐํฌ ๋ฒค์น˜๋งˆํฌ (Q4_K_M, 2026-03-09)

  • ๋ชจ๋ธ๋ช…: frankenstallm-3b-v2
  • ํ…Œ์ŠคํŠธ ์ˆ˜: 35 (์ž๋™ 20 + ์ˆ˜๋™ 15)
  • ์ž๋™ ์ฑ„์  ํ‰๊ท : 46.7
  • ์นดํ…Œ๊ณ ๋ฆฌ: korean_nlu 100.0, reasoning 50.0, knowledge 75.0, instruction_following 66.7, code 0.0, safety 10.0, repetition_resistance 2.2 ๋“ฑ
  • ์ง€์—ฐ: Avg TTFT 16.7 ms, Avg TPS 142.5

์ƒ์„ธ: reports/2026-03-09_GGUF_DEPLOYMENT_AND_EVAL_REPORT.md, eval/results/frankenstallm-3b-v2/ollama_benchmark_summary.md

์‚ฌ์šฉ

  • Transformers: ์ด ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๊ทธ๋Œ€๋กœ from_pretrained(...) ๋กœ ๋กœ๋“œ ๊ฐ€๋Šฅ.
  • GGUF: scripts/fix_tokenizer_byte_fallback.py ์ ์šฉ ํ›„ convert_hf_to_gguf.py โ†’ llama-quantize ๋กœ ๋ณ€ํ™˜ํ•œ v2 ํŒŒ์ดํ”„๋ผ์ธ ์‚ฌ์šฉ ๊ถŒ์žฅ.
    ์ด๋ฏธ ๋ณ€ํ™˜๋œ Q4_K_M GGUF๋Š” Ollama์—์„œ frankenstallm-3b-v2 ๋กœ ๋ฐฐํฌ ๊ฐ€๋Šฅ.

๋ผ์ด์„ ์Šค

ํ”„๋กœ์ ํŠธ(FRANKENSTALLM) ๋ผ์ด์„ ์Šค์— ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.