Upload source/PROGRESS.md with huggingface_hub
Browse files- source/PROGRESS.md +133 -0
source/PROGRESS.md
ADDED
|
@@ -0,0 +1,133 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# FRANKENSTALLM โ ํ๋ก์ ํธ ์งํ ํํฉ
|
| 2 |
+
|
| 3 |
+
> **๊ฐฑ์ **: 2026-03-06 (21:00)
|
| 4 |
+
> **๋ชฉํ**: ํ๊ตญ์ด 3B LLM์ ์ฒ์๋ถํฐ ํ์ตํ์ฌ Ollama๋ก ๋ฐฐํฌ
|
| 5 |
+
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
## ์ ์ฒด ์งํ๋ฅ : ์ฝ 78%
|
| 9 |
+
|
| 10 |
+
| # | ๋จ๊ณ | ๊ฐ์ค์น | ์ํ | ์๋ฃ์จ | ๊ธฐ์ฌ |
|
| 11 |
+
|---|------|--------|------|--------|------|
|
| 12 |
+
| 0 | ๊ธฐ๋ฐ ๊ตฌ์ถ & FP8 ๊ฒ์ฆ | 5% | โ
์๋ฃ | 100% | 5.0% |
|
| 13 |
+
| 1 | ๋ชจ๋ธ ์ํคํ
์ฒ ๊ตฌํ | 5% | โ
์๋ฃ | 100% | 5.0% |
|
| 14 |
+
| 2 | ๋ฐ์ดํฐ ํ์ดํ๋ผ์ธ | 10% | โ
์๋ฃ | 100% | 10.0% |
|
| 15 |
+
| 3 | 3B ์ฌ์ ํ์ต (Pretrain) | 25% | โ
์๋ฃ | 100% | 25.0% |
|
| 16 |
+
| 4 | SFT (Supervised Fine-Tuning) | 15% | โ
์๋ฃ | 100% | 15.0% |
|
| 17 |
+
| 5 | SFT ์ข
ํฉ ํ๊ฐ | 5% | โ
์๋ฃ | 100% | 5.0% |
|
| 18 |
+
| 6 | ORPO (์ ํธ๋ ์ ๋ ฌ) | 15% | ๐ ์ค๋น ์๋ฃ | 0% | 0% |
|
| 19 |
+
| 7 | ์ต์ข
ํ๊ฐ | 5% | โณ ๋๊ธฐ | 0% | 0% |
|
| 20 |
+
| 8 | GGUF ๋ณํ & Ollama ๋ฐฐํฌ | 10% | โณ ๋๊ธฐ | 0% | 0% |
|
| 21 |
+
| 9 | HuggingFace ๊ณต๊ฐ | 5% | โณ ๋๊ธฐ | 0% | 0% |
|
| 22 |
+
|
| 23 |
+
**ํฉ๊ณ: 5.0 + 5.0 + 10.0 + 25.0 + 15.0 + 5.0 + 13.0 = 65.0% (ORPO ํฌํจ ์ ~78%)**
|
| 24 |
+
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
## Phase๋ณ ์์ธ ํํฉ
|
| 28 |
+
|
| 29 |
+
### โ
Phase 0: ๊ธฐ๋ฐ ๊ตฌ์ถ & FP8 ๊ฒ์ฆ (์๋ฃ, Feb 25 ~ Mar 2)
|
| 30 |
+
|
| 31 |
+
- 8x B200 ํ๊ฒฝ ๊ฒ์ฆ, 125M FP8 ํ์ดํ๋ผ์ธ ์ฑ๊ณต
|
| 32 |
+
- GQA FlashAttention native โ VRAM 60.4 โ 48.3 GB (-20%)
|
| 33 |
+
- DDP gradient_as_bucket_view, NCCL NVLS, SIGHUP 3์ค ๋ฐฉ์ด
|
| 34 |
+
- torch.compile ํ
์คํธ โ ํจ๊ณผ ์์ (TE opaque kernel)
|
| 35 |
+
|
| 36 |
+
### โ
Phase 1: 3B Pretrain (์๋ฃ, Mar 2~5)
|
| 37 |
+
|
| 38 |
+
| ํญ๋ชฉ | ๊ฐ |
|
| 39 |
+
|------|-----|
|
| 40 |
+
| ํ์ต ์คํ
| 57,000 (100%) |
|
| 41 |
+
| ์ต์ข
Loss | **1.466** |
|
| 42 |
+
| ์ด ํ ํฐ | ~41.12B (38.5B unique + ๋ฐ๋ณต) |
|
| 43 |
+
| ํ์ต ์๊ฐ | **62.94์๊ฐ** |
|
| 44 |
+
| ์ฒ๋ฆฌ ์๋ | 38.5K tok/s per GPU |
|
| 45 |
+
| VRAM | 48.3 GB (26.4%) |
|
| 46 |
+
| ์ฌ๊ณ | 0๊ฑด |
|
| 47 |
+
|
| 48 |
+
### โ
Phase 2: SFT (์๋ฃ, Mar 5~6)
|
| 49 |
+
|
| 50 |
+
| ํญ๋ชฉ | ๊ฐ |
|
| 51 |
+
|------|-----|
|
| 52 |
+
| ์ต์ข
์คํ
| **25,500 / 33,000** (77.3%, early stopping) |
|
| 53 |
+
| Best val_loss | **1.8851** (step 23,000) |
|
| 54 |
+
| ํ์ต ์๊ฐ | **~15์๊ฐ 41๋ถ** |
|
| 55 |
+
| ๋ฐ์ดํฐ | 24๊ฐ ์์ค โ **2,439,397 samples** (7.48 GB) |
|
| 56 |
+
| VRAM | 24.2 GB (13.2%) |
|
| 57 |
+
| ์ฌ๊ณ | 0๊ฑด |
|
| 58 |
+
|
| 59 |
+
**Val Loss ์ถ์ด**:
|
| 60 |
+
```
|
| 61 |
+
Step 500: 2.0732
|
| 62 |
+
Step 2,000: 1.9558
|
| 63 |
+
Step 5,000: 1.9107
|
| 64 |
+
Step 10,000: 1.8917
|
| 65 |
+
Step 15,000: 1.8864
|
| 66 |
+
Step 20,000: 1.8853
|
| 67 |
+
Step 23,000: 1.8851 โ BEST
|
| 68 |
+
Step 25,500: 1.8851 โ Early Stop (patience 5/5)
|
| 69 |
+
```
|
| 70 |
+
|
| 71 |
+
### โ
Phase 2.5: SFT ์ข
ํฉ ํ๊ฐ (์๋ฃ, Mar 6)
|
| 72 |
+
|
| 73 |
+
**6์ฐจ์ ํ๊ฐ ๊ฒฐ๊ณผ**: 4/6 PASS
|
| 74 |
+
|
| 75 |
+
| ์ฐจ์ | ๊ฒฐ๊ณผ | ํต์ฌ ์์น |
|
| 76 |
+
|------|------|-----------|
|
| 77 |
+
| Perplexity (์ง์ ๋ณด์กด) | **PASS** | forgetting 0.9% |
|
| 78 |
+
| ์์ฑ ํ์ง | **FAIL** | Greedy ๋ฐ๋ณต๋ฅ 72.97% |
|
| 79 |
+
| ํ๊ตญ์ด ๋ฒค์น๋งํฌ | **FAIL** | KoBEST ํ๊ท 43.26% |
|
| 80 |
+
| ์์ด ๋ฒค์น๋งํฌ | **PASS** | ์ ํ์คํฌ ํํ ์ด๊ณผ |
|
| 81 |
+
| Calibration | **PASS** | Top-1 68.59% |
|
| 82 |
+
| SFT Chat ๋ฅ๋ ฅ | **PASS** | EOS ์ข
๋ฃ์จ 60% (Base 0%) |
|
| 83 |
+
|
| 84 |
+
**ํ์ **: ORPO ์งํ (์ง์ ๋ณด์กด ์ํธ, ๋ฐ๋ณต๋ฅ ํด๊ฒฐ ํ์)
|
| 85 |
+
|
| 86 |
+
### ๐ Phase 3: ORPO (์ค๋น ์๋ฃ, ๋ฏธ์คํ)
|
| 87 |
+
|
| 88 |
+
| ํญ๋ชฉ | ๊ฐ |
|
| 89 |
+
|------|-----|
|
| 90 |
+
| Base ๋ชจ๋ธ | `checkpoints/korean_3b_sft_v1/checkpoint-best/` |
|
| 91 |
+
| ๋ฐ์ดํฐ | 795,468 preference pairs (7.9 GB) |
|
| 92 |
+
| ์ค์ | `configs/korean_3b_orpo.yaml` |
|
| 93 |
+
| ๋ฐ์ฒ | `scripts/launch_3b_orpo.sh` |
|
| 94 |
+
| ๋ชฉํ | Greedy ๋ฐ๋ณต๋ฅ < 5%, EOS > 90% |
|
| 95 |
+
|
| 96 |
+
### โณ Phase 4: GGUF ๋ณํ & Ollama ๋ฐฐํฌ (๋๊ธฐ)
|
| 97 |
+
|
| 98 |
+
- `scripts/convert_3b_gguf.sh` ์ค๋น ์๋ฃ
|
| 99 |
+
- `scripts/deploy_3b_ollama.sh` ์ค๋น ์๋ฃ
|
| 100 |
+
- `Modelfile.3b` ์์ฑ ์๋ฃ
|
| 101 |
+
|
| 102 |
+
---
|
| 103 |
+
|
| 104 |
+
## ์ฃผ์ ํ์ผ ๊ฒฝ๋ก
|
| 105 |
+
|
| 106 |
+
| ํ์ผ | ์ค๋ช
|
|
| 107 |
+
|------|------|
|
| 108 |
+
| `checkpoints/korean_3b_fp8_run1/checkpoint-0057000/` | 3B Base ๋ชจ๋ธ (Phase 1 ์ต์ข
) |
|
| 109 |
+
| `checkpoints/korean_3b_sft_v1/checkpoint-best/` | **3B SFT ๋ชจ๋ธ (Phase 2 ์ต์ข
)** |
|
| 110 |
+
| `configs/korean_3b_orpo.yaml` | ORPO ์ค์ |
|
| 111 |
+
| `data/preference/combined_preference.jsonl` | ORPO ํ์ต ๋ฐ์ดํฐ (795K pairs) |
|
| 112 |
+
| `reports/2026-03-06_3B_SFT_COMPLETION_AND_EVAL_SUMMARY.md` | SFT ์๋ฃ + ํ๊ฐ ์์ฝ |
|
| 113 |
+
| `reports/2026-03-06_3B_SFT_EVALUATION_REPORT.md` | SFT 6์ฐจ์ ํ๊ฐ ์์ธ |
|
| 114 |
+
|
| 115 |
+
---
|
| 116 |
+
|
| 117 |
+
## ํ์๋ผ์ธ
|
| 118 |
+
|
| 119 |
+
```
|
| 120 |
+
Feb 25 Phase 0 ์์ (๊ธฐ๋ฐ ๊ตฌ์ถ, 125M FP8 ๊ฒ์ฆ)
|
| 121 |
+
Feb 25-26 1B Pretrain (34K steps, loss 1.904)
|
| 122 |
+
Feb 26 1B SFT v1 ์คํจ (label off-by-one)
|
| 123 |
+
Feb 27 1B SFT v2 ์ฑ๊ณต (val_loss 2.206, ๋ฐ๋ณต๋ฅ 18%)
|
| 124 |
+
Feb 27 ์ ์คํฐ์ค๋ฆฌ๊ทธ ํ ๋ก โ 3B ์ ํ ๊ฒฐ์
|
| 125 |
+
Feb 27 640GB+ ๋ฐ์ดํฐ ์กฐ๋ฆฝ
|
| 126 |
+
Mar 02 Phase 0 ์๋ฃ (GQA FA, DDP, NCCL ์ต์ ํ)
|
| 127 |
+
Mar 02 Phase 1 ์์ (3B Pretrain)
|
| 128 |
+
Mar 05 Phase 1 ์๋ฃ (57K steps, loss 1.466, 63์๊ฐ)
|
| 129 |
+
Mar 05 Phase 2 ์์ (SFT, 2.44M samples)
|
| 130 |
+
Mar 06 Phase 2 ์๋ฃ (25.5K steps, val_loss 1.8851, early stopping)
|
| 131 |
+
Mar 06 SFT 6์ฐจ์ ํ๊ฐ ์๋ฃ (4/6 PASS)
|
| 132 |
+
Mar 06 โ ORPO ์งํ ๊ฒฐ์ (Phase 3 ์ค๋น ์๋ฃ)
|
| 133 |
+
```
|