docs: update training data distribution with accurate numbers (SFT 36,713 + DPO 24,779)
Browse files
README.md
CHANGED
|
@@ -21,15 +21,17 @@ pipeline_tag: text-generation
|
|
| 21 |
**ํ๊ตญ ์ฃผ์์์ฅ ์ ๋ฌธ AI ์ ๋๋ฆฌ์คํธ**
|
| 22 |
|
| 23 |
VELA๋ ํ๊ตญ ์ฃผ์์์ฅ ๋ด์ค ๋ถ์ ๋ฐ ํฌ์ ๋ฆฌ์์น๋ฅผ ์ํด ํนํ๋ 7B ํ๋ผ๋ฏธํฐ ์ธ์ด ๋ชจ๋ธ์
๋๋ค.
|
|
|
|
| 24 |
|
| 25 |
## Model Details
|
| 26 |
|
| 27 |
| ํญ๋ชฉ | ๋ด์ฉ |
|
| 28 |
|------|------|
|
| 29 |
| **Base Model** | Qwen/Qwen2.5-7B-Instruct |
|
| 30 |
-
| **Training** | SFT (
|
| 31 |
| **Parameters** | 7.6B |
|
| 32 |
| **Context Length** | 8,192 tokens |
|
|
|
|
| 33 |
| **License** | Apache 2.0 |
|
| 34 |
|
| 35 |
### Available Formats
|
|
@@ -45,22 +47,58 @@ VELA๋ ํ๊ตญ ์ฃผ์์์ฅ ๋ด์ค ๋ถ์ ๋ฐ ํฌ์ ๋ฆฌ์์น๋ฅผ ์ํด ํนํ
|
|
| 45 |
```
|
| 46 |
Qwen2.5-7B-Instruct
|
| 47 |
โ
|
| 48 |
-
SFT (
|
| 49 |
-
-
|
| 50 |
-
-
|
| 51 |
-
-
|
|
|
|
|
|
|
|
|
|
| 52 |
โ
|
| 53 |
-
DPO (
|
| 54 |
-
-
|
| 55 |
-
-
|
| 56 |
-
-
|
|
|
|
|
|
|
| 57 |
โ
|
| 58 |
VELA
|
| 59 |
```
|
| 60 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 61 |
## Capabilities
|
| 62 |
|
| 63 |
- **๋ด์ค ์ํฅ ๋ถ์**: ์ฃผ์ ๊ด๋ จ ๋ด์ค์ ์์ฅ ์ํฅ๋ ์์ธก
|
|
|
|
| 64 |
- **๋ฆฌ์์น ๋ฆฌํฌํธ ์์ฑ**: ๊ตฌ์กฐํ๋ ํฌ์ ๋ถ์ ๋ณด๊ณ ์ (7๊ฐ ์น์
)
|
| 65 |
- **Reasoning Trace**: ๋จ๊ณ๋ณ ๋ถ์ ์ฌ๊ณ ๊ณผ์ (JSON ํ์)
|
| 66 |
- **๋ค์ค ์์ค ์ข
ํฉ**: ๋ด์ค, ์์ธ, ์๊ธ ๋ฐ์ดํฐ ํตํฉ ๋ถ์
|
|
@@ -193,15 +231,6 @@ VELA๋ ๋ ๊ฐ์ง ์ถ๋ ฅ ๋ชจ๋๋ฅผ ์ง์ํฉ๋๋ค:
|
|
| 193 |
## ํฌ์ ์๊ฒฌ
|
| 194 |
```
|
| 195 |
|
| 196 |
-
## Training Data
|
| 197 |
-
|
| 198 |
-
| ๋ฐ์ดํฐ์
| ์ํ ์ | ์ฉ๋ |
|
| 199 |
-
|----------|---------|------|
|
| 200 |
-
| ํ๊ตญ ์ฃผ์ ๋ด์ค | 412K | SFT ๊ธฐ๋ฐ ๋ฐ์ดํฐ |
|
| 201 |
-
| ๋ฆฌ์์น ๋ฆฌํฌํธ | 50K | ๋ถ์ ํ์ ํ์ต |
|
| 202 |
-
| Reasoning Traces | 5K | ์ฌ๊ณ ๊ณผ์ ํ์ต |
|
| 203 |
-
| DPO Pairs | 7.7K | ์ ํธ๋ ์ ๋ ฌ |
|
| 204 |
-
|
| 205 |
## DPO Improvements
|
| 206 |
|
| 207 |
- โ
**์ค๊ตญ์ด leak ์ ๊ฑฐ**: Stress test 10/10 CLEAN
|
|
@@ -232,7 +261,7 @@ VELA๋ ๋ ๊ฐ์ง ์ถ๋ ฅ ๋ชจ๋๋ฅผ ์ง์ํฉ๋๋ค:
|
|
| 232 |
|
| 233 |
| ๋ฒ์ | ๋ ์ง | ๋ณ๊ฒฝ์ฌํญ |
|
| 234 |
|------|------|----------|
|
| 235 |
-
| v1.1 | 2026-02-12 | GGUF ์์ํ ๋ชจ๋ธ ์ถ๊ฐ (Q4_K_M, Q8_0),
|
| 236 |
| v1.0 | 2026-01-28 | DPO ๋ณํฉ, ์ค๊ตญ์ด/์์ด leak ํด๊ฒฐ |
|
| 237 |
| v0.9 | 2026-01-15 | SFT ๋ฒ ์ด์ค ๋ชจ๋ธ ๊ณต๊ฐ |
|
| 238 |
|
|
|
|
| 21 |
**ํ๊ตญ ์ฃผ์์์ฅ ์ ๋ฌธ AI ์ ๋๋ฆฌ์คํธ**
|
| 22 |
|
| 23 |
VELA๋ ํ๊ตญ ์ฃผ์์์ฅ ๋ด์ค ๋ถ์ ๋ฐ ํฌ์ ๋ฆฌ์์น๋ฅผ ์ํด ํนํ๋ 7B ํ๋ผ๋ฏธํฐ ์ธ์ด ๋ชจ๋ธ์
๋๋ค.
|
| 24 |
+
2,135๊ฐ ์ข
๋ชฉ์ ๋ํ ๋ด์ค ์ํฅ ๋ถ์, ์ฆ๊ถ์ฌ ๋ฆฌํฌํธ ํด์, Reasoning Trace ๊ธฐ๋ฐ ๊ตฌ์กฐํ๋ ํฌ์ ๋ถ์์ ์ํํฉ๋๋ค.
|
| 25 |
|
| 26 |
## Model Details
|
| 27 |
|
| 28 |
| ํญ๋ชฉ | ๋ด์ฉ |
|
| 29 |
|------|------|
|
| 30 |
| **Base Model** | Qwen/Qwen2.5-7B-Instruct |
|
| 31 |
+
| **Training** | SFT (36,713) + DPO (24,779 pairs) |
|
| 32 |
| **Parameters** | 7.6B |
|
| 33 |
| **Context Length** | 8,192 tokens |
|
| 34 |
+
| **Stock Coverage** | 2,135 ์ข
๋ชฉ (KOSPI + KOSDAQ) |
|
| 35 |
| **License** | Apache 2.0 |
|
| 36 |
|
| 37 |
### Available Formats
|
|
|
|
| 47 |
```
|
| 48 |
Qwen2.5-7B-Instruct
|
| 49 |
โ
|
| 50 |
+
SFT (36,713 samples)
|
| 51 |
+
- ๋ด์ค ๋ถ๋ฅ ๋ถ์ 10,830
|
| 52 |
+
- ๊ทน๋จ ์๊ทธ๋ ๋ถ์ 9,603
|
| 53 |
+
- ์ฆ๊ถ์ฌ ๋ฆฌํฌํธ 5,117
|
| 54 |
+
- ๋ด์ค ์ํฅ ๋ถ์ 4,839
|
| 55 |
+
- Tool Calling 1,965
|
| 56 |
+
- ๊ธฐํ (๋น๊ต๋ถ์, ์ค์ , ๋ฆฌ์คํฌ, ์๊ธ, ์นํฐ, ๋งคํฌ๋ก) 4,359
|
| 57 |
โ
|
| 58 |
+
DPO (24,779 pairs)
|
| 59 |
+
- ์ค๋ณต ์ ๊ฑฐ ๊ธฐ๋ณธ ํ์ด 12,000
|
| 60 |
+
- ๋ค๊ตญ์ด leak ๋ณด๊ฐ 5,997
|
| 61 |
+
- VELA ChatML ์ ๋ ฌ 5,000
|
| 62 |
+
- ์ค๊ตญ์ด leak ๊ต์ v2 1,216
|
| 63 |
+
- Reasoning Trace ์ ๋ ฌ 566
|
| 64 |
โ
|
| 65 |
VELA
|
| 66 |
```
|
| 67 |
|
| 68 |
+
## Training Data Distribution
|
| 69 |
+
|
| 70 |
+
### SFT (36,713 samples, 2,135 ์ข
๋ชฉ)
|
| 71 |
+
|
| 72 |
+
| Source | Samples | Ratio | Description |
|
| 73 |
+
|--------|---------|-------|-------------|
|
| 74 |
+
| **classified_news** | 10,830 | 29.5% | GPT-4o ๋ถ๋ฅ๋ ๋ด์ค โ Reasoning Trace ์์ฑ |
|
| 75 |
+
| **extreme_signals** | 9,603 | 26.2% | ๊ธ๋ฑ/๊ธ๋ฝ ์๊ทธ๋ ๋ด์ค ๋ถ์ |
|
| 76 |
+
| **securities_report_gpt4o** | 5,117 | 13.9% | ์ฆ๊ถ์ฌ ๋ฆฌํฌํธ GPT-4o ์ฌ๊ตฌ์ฑ (๋ค์ด๋ฒ ์ข
๋ชฉ๋ถ์ + ๋ฏธ๋์์
) |
|
| 77 |
+
| **analysis_news** | 4,839 | 13.2% | ์ผ๋ฐ ๋ด์ค ์ํฅ ๋ถ์ |
|
| 78 |
+
| **tool_calling** | 1,965 | 5.4% | Search/Price/Investor ๋๊ตฌ ํธ์ถ ํ์ต |
|
| 79 |
+
| **multi_stock_comparison** | 981 | 2.7% | ๋ค์ค ์ข
๋ชฉ ๋น๊ต ๋ถ์ |
|
| 80 |
+
| **earnings_impact** | 971 | 2.6% | ์ค์ ๋ฐํ ์ํฅ ๋ถ์ |
|
| 81 |
+
| **risk_alert** | 948 | 2.6% | ๋ฆฌ์คํฌ ๊ฒฝ๋ณด ๋ถ์ |
|
| 82 |
+
| **supply_demand** | 492 | 1.3% | ์๊ธ ๋ํฅ ๋ถ์ |
|
| 83 |
+
| **sector_theme** | 486 | 1.3% | ์นํฐ/ํ
๋ง ๋ถ์ |
|
| 84 |
+
| **macro_impact** | 481 | 1.3% | ๋งคํฌ๋ก ์งํ ์ํฅ ๋ถ์ |
|
| 85 |
+
|
| 86 |
+
> ํ๊ท ์๋ต ๊ธธ์ด: 2,337์ (Reasoning Trace JSON + ๋ถ์ ๋ฆฌํฌํธ ํฌํจ)
|
| 87 |
+
|
| 88 |
+
### DPO (24,779 pairs)
|
| 89 |
+
|
| 90 |
+
| Source | Pairs | Ratio | Description |
|
| 91 |
+
|--------|-------|-------|-------------|
|
| 92 |
+
| **dpo_dedup** | 12,000 | 48.4% | ์ค๋ณต ์ ๊ฑฐ๋ ๊ธฐ๋ณธ DPO ํ์ด |
|
| 93 |
+
| **multilingual_aug** | 5,997 | 24.2% | ์ค๊ตญ์ด/์์ด leak ๋ณด๊ฐ (rejected์ leak ์ฝ์
) |
|
| 94 |
+
| **vela_chatml** | 5,000 | 20.2% | VELA ์์คํ
ํ๋กฌํํธ ์ ๋ ฌ |
|
| 95 |
+
| **chinese_leak_v2** | 1,216 | 4.9% | ์ค๊ตญ์ด leak ์ง์ค ๊ต์ |
|
| 96 |
+
| **reasoning_trace_2k** | 566 | 2.3% | Reasoning Trace ํ์ ์ ๋ ฌ |
|
| 97 |
+
|
| 98 |
## Capabilities
|
| 99 |
|
| 100 |
- **๋ด์ค ์ํฅ ๋ถ์**: ์ฃผ์ ๊ด๋ จ ๋ด์ค์ ์์ฅ ์ํฅ๋ ์์ธก
|
| 101 |
+
- **์ฆ๊ถ์ฌ ๋ฆฌํฌํธ ํด์**: ์ ๋๋ฆฌ์คํธ ๋ฆฌํฌํธ ๊ธฐ๋ฐ ํฌ์ ๋ถ์
|
| 102 |
- **๋ฆฌ์์น ๋ฆฌํฌํธ ์์ฑ**: ๊ตฌ์กฐํ๋ ํฌ์ ๋ถ์ ๋ณด๊ณ ์ (7๊ฐ ์น์
)
|
| 103 |
- **Reasoning Trace**: ๋จ๊ณ๋ณ ๋ถ์ ์ฌ๊ณ ๊ณผ์ (JSON ํ์)
|
| 104 |
- **๋ค์ค ์์ค ์ข
ํฉ**: ๋ด์ค, ์์ธ, ์๊ธ ๋ฐ์ดํฐ ํตํฉ ๋ถ์
|
|
|
|
| 231 |
## ํฌ์ ์๊ฒฌ
|
| 232 |
```
|
| 233 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 234 |
## DPO Improvements
|
| 235 |
|
| 236 |
- โ
**์ค๊ตญ์ด leak ์ ๊ฑฐ**: Stress test 10/10 CLEAN
|
|
|
|
| 261 |
|
| 262 |
| ๋ฒ์ | ๋ ์ง | ๋ณ๊ฒฝ์ฌํญ |
|
| 263 |
|------|------|----------|
|
| 264 |
+
| v1.1 | 2026-02-12 | GGUF ์์ํ ๋ชจ๋ธ ์ถ๊ฐ (Q4_K_M, Q8_0), ๋ฒค์น๋งํฌ, ํ์ต ๋ฐ์ดํฐ ๋ถํฌ ๊ณต๊ฐ |
|
| 265 |
| v1.0 | 2026-01-28 | DPO ๋ณํฉ, ์ค๊ตญ์ด/์์ด leak ํด๊ฒฐ |
|
| 266 |
| v0.9 | 2026-01-15 | SFT ๋ฒ ์ด์ค ๋ชจ๋ธ ๊ณต๊ฐ |
|
| 267 |
|