somebody-to-love commited on
Commit
f45e7ff
ยท
verified ยท
1 Parent(s): 3d85abb

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +68 -3
README.md CHANGED
@@ -1,3 +1,68 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ko
4
+ license: other
5
+ tags:
6
+ - llm
7
+ - korean
8
+ - orpo
9
+ - gguf
10
+ ---
11
+
12
+ # FRANKENSTALLM 3B v2 (Byte-Fallback Fixed)
13
+
14
+ ํ•œ๊ตญ์–ด ์ค‘์‹ฌ **FRANKENSTALLM 3B** ORPO ํŒŒ์ธํŠœ๋‹ ์ฒดํฌํฌ์ธํŠธ์— **byte-fallback ํ† ํฐ 256๊ฐœ**๋ฅผ ์ถ”๊ฐ€ํ•œ ๋ฒ„์ „์ž…๋‹ˆ๋‹ค.
15
+ llama.cpp/GGUF ์ถ”๋ก  ์‹œ ์ค„๋ฐ”๊ฟˆ(`\n`) ๋“ฑ ๋ฏธ๋“ฑ๋ก ๋ฌธ์ž๋กœ ์ธํ•œ ํฌ๋ž˜์‹œ๋ฅผ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.
16
+
17
+ ## ๋ชจ๋ธ ์ƒ์„ธ
18
+
19
+ | ํ•ญ๋ชฉ | ๊ฐ’ |
20
+ |------|-----|
21
+ | **Architecture** | LlamaForCausalLM |
22
+ | **Params** | ~3B |
23
+ | **Hidden size** | 2048 |
24
+ | **Layers** | 24 |
25
+ | **Attention heads** | 16 |
26
+ | **KV heads** | 4 |
27
+ | **Max position** | 4096 |
28
+ | **Vocab size** | **64,256** (64,000 + 256 byte-fallback) |
29
+ | **Training** | ORPO (SFT โ†’ ORPO) |
30
+
31
+ ## ๋ณ€๊ฒฝ ์‚ฌํ•ญ (v2)
32
+
33
+ - ํ† ํฌ๋‚˜์ด์ €: `byte_fallback=True`, `<0x00>`~`<0xFF>` 256๊ฐœ ํ† ํฐ ์ถ”๊ฐ€
34
+ - ์ž„๋ฒ ๋”ฉ: 64,000 โ†’ 64,256 ๋ฆฌ์‚ฌ์ด์ฆˆ, ์ƒˆ ํ† ํฐ ์ดˆ๊ธฐํ™”
35
+ - GGUF ๋ณ€ํ™˜ยทOllama ๋ฐฐํฌ ์‹œ ๋‰ด๋ผ์ธ ํฌํ•จ ์ž…๋ ฅ ์ •์ƒ ์ฒ˜๋ฆฌ ํ™•์ธ
36
+
37
+ ## ORPO ํ‰๊ฐ€ ์š”์•ฝ (๋™์ผ ์ฒดํฌํฌ์ธํŠธ ๊ธฐ์ค€)
38
+
39
+ - **ํ‰๊ฐ€ ์ผ์‹œ**: 2026-03-09
40
+ - **Preference Accuracy**: 76.02%
41
+ - **Reward Margin**: 0.6100
42
+ - **Eval Loss**: 1.7910 โ†’ 1.6250
43
+ - **KoBEST (0-shot) ํ‰๊ท **: 52.75%
44
+ - **์ƒ์„ฑ ํ’ˆ์งˆ**: Greedy 3-gram ๋ฐ˜๋ณต๋ฅ  30.89%, EOS ์ข…๋ฃŒ์œจ 66.67%
45
+ - **PPL Forgetting**: ์ตœ๋Œ€ 4.1% (๊ธฐ์ค€ <15%)
46
+ - **์ข…ํ•ฉ**: 7/10 ์ฐจ์› ํ†ต๊ณผ, ์ •๋Ÿ‰ ์Šค์ฝ”์–ด 63.7/100
47
+
48
+ ์ƒ์„ธ: ํ”„๋กœ์ ํŠธ ๋‚ด `reports/2026-03-09_ORPO_EVALUATION_REPORT.md` ์ฐธ๊ณ .
49
+
50
+ ## Ollama ๋ฐฐํฌ ๋ฒค์น˜๋งˆํฌ (Q4_K_M, 2026-03-09)
51
+
52
+ - **๋ชจ๋ธ๋ช…**: `frankenstallm-3b-v2`
53
+ - **ํ…Œ์ŠคํŠธ ์ˆ˜**: 35 (์ž๋™ 20 + ์ˆ˜๋™ 15)
54
+ - **์ž๋™ ์ฑ„์  ํ‰๊ท **: 46.7
55
+ - **์นดํ…Œ๊ณ ๋ฆฌ**: korean_nlu 100.0, reasoning 50.0, knowledge 75.0, instruction_following 66.7, code 0.0, safety 10.0, repetition_resistance 2.2 ๋“ฑ
56
+ - **์ง€์—ฐ**: Avg TTFT 16.7 ms, Avg TPS 142.5
57
+
58
+ ์ƒ์„ธ: `reports/2026-03-09_GGUF_DEPLOYMENT_AND_EVAL_REPORT.md`, `eval/results/frankenstallm-3b-v2/ollama_benchmark_summary.md`
59
+
60
+ ## ์‚ฌ์šฉ
61
+
62
+ - **Transformers**: ์ด ์ฒดํฌํฌ์ธํŠธ๋ฅผ ๊ทธ๋Œ€๋กœ `from_pretrained(...)` ๋กœ ๋กœ๋“œ ๊ฐ€๋Šฅ.
63
+ - **GGUF**: `scripts/fix_tokenizer_byte_fallback.py` ์ ์šฉ ํ›„ `convert_hf_to_gguf.py` โ†’ `llama-quantize` ๋กœ ๋ณ€ํ™˜ํ•œ v2 ํŒŒ์ดํ”„๋ผ์ธ ์‚ฌ์šฉ ๊ถŒ์žฅ.
64
+ ์ด๋ฏธ ๋ณ€ํ™˜๋œ Q4_K_M GGUF๋Š” Ollama์—์„œ `frankenstallm-3b-v2` ๋กœ ๋ฐฐํฌ ๊ฐ€๋Šฅ.
65
+
66
+ ## ๋ผ์ด์„ ์Šค
67
+
68
+ ํ”„๋กœ์ ํŠธ(FRANKENSTALLM) ๋ผ์ด์„ ์Šค์— ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.