intrect commited on
Commit
d35ad96
ยท
verified ยท
1 Parent(s): ecb3e2c

docs: update training data distribution with accurate numbers (SFT 36,713 + DPO 24,779)

Browse files
Files changed (1) hide show
  1. README.md +48 -19
README.md CHANGED
@@ -21,15 +21,17 @@ pipeline_tag: text-generation
21
  **ํ•œ๊ตญ ์ฃผ์‹์‹œ์žฅ ์ „๋ฌธ AI ์• ๋„๋ฆฌ์ŠคํŠธ**
22
 
23
  VELA๋Š” ํ•œ๊ตญ ์ฃผ์‹์‹œ์žฅ ๋‰ด์Šค ๋ถ„์„ ๋ฐ ํˆฌ์ž ๋ฆฌ์„œ์น˜๋ฅผ ์œ„ํ•ด ํŠนํ™”๋œ 7B ํŒŒ๋ผ๋ฏธํ„ฐ ์–ธ์–ด ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
 
24
 
25
  ## Model Details
26
 
27
  | ํ•ญ๋ชฉ | ๋‚ด์šฉ |
28
  |------|------|
29
  | **Base Model** | Qwen/Qwen2.5-7B-Instruct |
30
- | **Training** | SFT (930K) + DPO (7,681 pairs) |
31
  | **Parameters** | 7.6B |
32
  | **Context Length** | 8,192 tokens |
 
33
  | **License** | Apache 2.0 |
34
 
35
  ### Available Formats
@@ -45,22 +47,58 @@ VELA๋Š” ํ•œ๊ตญ ์ฃผ์‹์‹œ์žฅ ๋‰ด์Šค ๋ถ„์„ ๋ฐ ํˆฌ์ž ๋ฆฌ์„œ์น˜๋ฅผ ์œ„ํ•ด ํŠนํ™”
45
  ```
46
  Qwen2.5-7B-Instruct
47
  โ†“
48
- SFT (930K samples)
49
- - ํ•œ๊ตญ ์ฃผ์‹ ๋‰ด์Šค ๋ถ„์„ (412K)
50
- - ๋ฆฌ์„œ์น˜ ๋ฆฌํฌํŠธ ์ƒ์„ฑ (50K)
51
- - Reasoning Trace ํ•™์Šต (5K)
 
 
 
52
  โ†“
53
- DPO (7,681 pairs)
54
- - ์ค‘๊ตญ์–ด/์˜์–ด leak ๊ต์ •
55
- - ํ•œ๊ตญ์–ด ์ถœ๋ ฅ ๊ฐ•ํ™”
56
- - ํ˜•์‹ ์ค€์ˆ˜ ํ–ฅ์ƒ
 
 
57
  โ†“
58
  VELA
59
  ```
60
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
  ## Capabilities
62
 
63
  - **๋‰ด์Šค ์˜ํ–ฅ ๋ถ„์„**: ์ฃผ์‹ ๊ด€๋ จ ๋‰ด์Šค์˜ ์‹œ์žฅ ์˜ํ–ฅ๋„ ์˜ˆ์ธก
 
64
  - **๋ฆฌ์„œ์น˜ ๋ฆฌํฌํŠธ ์ƒ์„ฑ**: ๊ตฌ์กฐํ™”๋œ ํˆฌ์ž ๋ถ„์„ ๋ณด๊ณ ์„œ (7๊ฐœ ์„น์…˜)
65
  - **Reasoning Trace**: ๋‹จ๊ณ„๋ณ„ ๋ถ„์„ ์‚ฌ๊ณ ๊ณผ์ • (JSON ํ˜•์‹)
66
  - **๋‹ค์ค‘ ์†Œ์Šค ์ข…ํ•ฉ**: ๋‰ด์Šค, ์‹œ์„ธ, ์ˆ˜๊ธ‰ ๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ๋ถ„์„
@@ -193,15 +231,6 @@ VELA๋Š” ๋‘ ๊ฐ€์ง€ ์ถœ๋ ฅ ๋ชจ๋“œ๋ฅผ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค:
193
  ## ํˆฌ์ž ์˜๊ฒฌ
194
  ```
195
 
196
- ## Training Data
197
-
198
- | ๋ฐ์ดํ„ฐ์…‹ | ์ƒ˜ํ”Œ ์ˆ˜ | ์šฉ๋„ |
199
- |----------|---------|------|
200
- | ํ•œ๊ตญ ์ฃผ์‹ ๋‰ด์Šค | 412K | SFT ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ |
201
- | ๋ฆฌ์„œ์น˜ ๋ฆฌํฌํŠธ | 50K | ๋ถ„์„ ํ˜•์‹ ํ•™์Šต |
202
- | Reasoning Traces | 5K | ์‚ฌ๊ณ ๊ณผ์ • ํ•™์Šต |
203
- | DPO Pairs | 7.7K | ์„ ํ˜ธ๋„ ์ •๋ ฌ |
204
-
205
  ## DPO Improvements
206
 
207
  - โœ… **์ค‘๊ตญ์–ด leak ์ œ๊ฑฐ**: Stress test 10/10 CLEAN
@@ -232,7 +261,7 @@ VELA๋Š” ๋‘ ๊ฐ€์ง€ ์ถœ๋ ฅ ๋ชจ๋“œ๋ฅผ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค:
232
 
233
  | ๋ฒ„์ „ | ๋‚ ์งœ | ๋ณ€๊ฒฝ์‚ฌํ•ญ |
234
  |------|------|----------|
235
- | v1.1 | 2026-02-12 | GGUF ์–‘์žํ™” ๋ชจ๋ธ ์ถ”๊ฐ€ (Q4_K_M, Q8_0), ๋ฒค์น˜๋งˆํฌ |
236
  | v1.0 | 2026-01-28 | DPO ๋ณ‘ํ•ฉ, ์ค‘๊ตญ์–ด/์˜์–ด leak ํ•ด๊ฒฐ |
237
  | v0.9 | 2026-01-15 | SFT ๋ฒ ์ด์Šค ๋ชจ๋ธ ๊ณต๊ฐœ |
238
 
 
21
  **ํ•œ๊ตญ ์ฃผ์‹์‹œ์žฅ ์ „๋ฌธ AI ์• ๋„๋ฆฌ์ŠคํŠธ**
22
 
23
  VELA๋Š” ํ•œ๊ตญ ์ฃผ์‹์‹œ์žฅ ๋‰ด์Šค ๋ถ„์„ ๋ฐ ํˆฌ์ž ๋ฆฌ์„œ์น˜๋ฅผ ์œ„ํ•ด ํŠนํ™”๋œ 7B ํŒŒ๋ผ๋ฏธํ„ฐ ์–ธ์–ด ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
24
+ 2,135๊ฐœ ์ข…๋ชฉ์— ๋Œ€ํ•œ ๋‰ด์Šค ์˜ํ–ฅ ๋ถ„์„, ์ฆ๊ถŒ์‚ฌ ๋ฆฌํฌํŠธ ํ•ด์„, Reasoning Trace ๊ธฐ๋ฐ˜ ๊ตฌ์กฐํ™”๋œ ํˆฌ์ž ๋ถ„์„์„ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.
25
 
26
  ## Model Details
27
 
28
  | ํ•ญ๋ชฉ | ๋‚ด์šฉ |
29
  |------|------|
30
  | **Base Model** | Qwen/Qwen2.5-7B-Instruct |
31
+ | **Training** | SFT (36,713) + DPO (24,779 pairs) |
32
  | **Parameters** | 7.6B |
33
  | **Context Length** | 8,192 tokens |
34
+ | **Stock Coverage** | 2,135 ์ข…๋ชฉ (KOSPI + KOSDAQ) |
35
  | **License** | Apache 2.0 |
36
 
37
  ### Available Formats
 
47
  ```
48
  Qwen2.5-7B-Instruct
49
  โ†“
50
+ SFT (36,713 samples)
51
+ - ๋‰ด์Šค ๋ถ„๋ฅ˜ ๋ถ„์„ 10,830
52
+ - ๊ทน๋‹จ ์‹œ๊ทธ๋„ ๋ถ„์„ 9,603
53
+ - ์ฆ๊ถŒ์‚ฌ ๋ฆฌํฌํŠธ 5,117
54
+ - ๋‰ด์Šค ์˜ํ–ฅ ๋ถ„์„ 4,839
55
+ - Tool Calling 1,965
56
+ - ๊ธฐํƒ€ (๋น„๊ต๋ถ„์„, ์‹ค์ , ๋ฆฌ์Šคํฌ, ์ˆ˜๊ธ‰, ์„นํ„ฐ, ๋งคํฌ๋กœ) 4,359
57
  โ†“
58
+ DPO (24,779 pairs)
59
+ - ์ค‘๋ณต ์ œ๊ฑฐ ๊ธฐ๋ณธ ํŽ˜์–ด 12,000
60
+ - ๋‹ค๊ตญ์–ด leak ๋ณด๊ฐ• 5,997
61
+ - VELA ChatML ์ •๋ ฌ 5,000
62
+ - ์ค‘๊ตญ์–ด leak ๊ต์ • v2 1,216
63
+ - Reasoning Trace ์ •๋ ฌ 566
64
  โ†“
65
  VELA
66
  ```
67
 
68
+ ## Training Data Distribution
69
+
70
+ ### SFT (36,713 samples, 2,135 ์ข…๋ชฉ)
71
+
72
+ | Source | Samples | Ratio | Description |
73
+ |--------|---------|-------|-------------|
74
+ | **classified_news** | 10,830 | 29.5% | GPT-4o ๋ถ„๋ฅ˜๋œ ๋‰ด์Šค โ†’ Reasoning Trace ์ƒ์„ฑ |
75
+ | **extreme_signals** | 9,603 | 26.2% | ๊ธ‰๋“ฑ/๊ธ‰๋ฝ ์‹œ๊ทธ๋„ ๋‰ด์Šค ๋ถ„์„ |
76
+ | **securities_report_gpt4o** | 5,117 | 13.9% | ์ฆ๊ถŒ์‚ฌ ๋ฆฌํฌํŠธ GPT-4o ์žฌ๊ตฌ์„ฑ (๋„ค์ด๋ฒ„ ์ข…๋ชฉ๋ถ„์„ + ๋ฏธ๋ž˜์—์…‹) |
77
+ | **analysis_news** | 4,839 | 13.2% | ์ผ๋ฐ˜ ๋‰ด์Šค ์˜ํ–ฅ ๋ถ„์„ |
78
+ | **tool_calling** | 1,965 | 5.4% | Search/Price/Investor ๋„๊ตฌ ํ˜ธ์ถœ ํ•™์Šต |
79
+ | **multi_stock_comparison** | 981 | 2.7% | ๋‹ค์ค‘ ์ข…๋ชฉ ๋น„๊ต ๋ถ„์„ |
80
+ | **earnings_impact** | 971 | 2.6% | ์‹ค์  ๋ฐœํ‘œ ์˜ํ–ฅ ๋ถ„์„ |
81
+ | **risk_alert** | 948 | 2.6% | ๋ฆฌ์Šคํฌ ๊ฒฝ๋ณด ๋ถ„์„ |
82
+ | **supply_demand** | 492 | 1.3% | ์ˆ˜๊ธ‰ ๋™ํ–ฅ ๋ถ„์„ |
83
+ | **sector_theme** | 486 | 1.3% | ์„นํ„ฐ/ํ…Œ๋งˆ ๋ถ„์„ |
84
+ | **macro_impact** | 481 | 1.3% | ๋งคํฌ๋กœ ์ง€ํ‘œ ์˜ํ–ฅ ๋ถ„์„ |
85
+
86
+ > ํ‰๊ท  ์‘๋‹ต ๊ธธ์ด: 2,337์ž (Reasoning Trace JSON + ๋ถ„์„ ๋ฆฌํฌํŠธ ํฌํ•จ)
87
+
88
+ ### DPO (24,779 pairs)
89
+
90
+ | Source | Pairs | Ratio | Description |
91
+ |--------|-------|-------|-------------|
92
+ | **dpo_dedup** | 12,000 | 48.4% | ์ค‘๋ณต ์ œ๊ฑฐ๋œ ๊ธฐ๋ณธ DPO ํŽ˜์–ด |
93
+ | **multilingual_aug** | 5,997 | 24.2% | ์ค‘๊ตญ์–ด/์˜์–ด leak ๋ณด๊ฐ• (rejected์— leak ์‚ฝ์ž…) |
94
+ | **vela_chatml** | 5,000 | 20.2% | VELA ์‹œ์Šคํ…œ ํ”„๋กฌํ”„ํŠธ ์ •๋ ฌ |
95
+ | **chinese_leak_v2** | 1,216 | 4.9% | ์ค‘๊ตญ์–ด leak ์ง‘์ค‘ ๊ต์ • |
96
+ | **reasoning_trace_2k** | 566 | 2.3% | Reasoning Trace ํ˜•์‹ ์ •๋ ฌ |
97
+
98
  ## Capabilities
99
 
100
  - **๋‰ด์Šค ์˜ํ–ฅ ๋ถ„์„**: ์ฃผ์‹ ๊ด€๋ จ ๋‰ด์Šค์˜ ์‹œ์žฅ ์˜ํ–ฅ๋„ ์˜ˆ์ธก
101
+ - **์ฆ๊ถŒ์‚ฌ ๋ฆฌํฌํŠธ ํ•ด์„**: ์• ๋„๋ฆฌ์ŠคํŠธ ๋ฆฌํฌํŠธ ๊ธฐ๋ฐ˜ ํˆฌ์ž ๋ถ„์„
102
  - **๋ฆฌ์„œ์น˜ ๋ฆฌํฌํŠธ ์ƒ์„ฑ**: ๊ตฌ์กฐํ™”๋œ ํˆฌ์ž ๋ถ„์„ ๋ณด๊ณ ์„œ (7๊ฐœ ์„น์…˜)
103
  - **Reasoning Trace**: ๋‹จ๊ณ„๋ณ„ ๋ถ„์„ ์‚ฌ๊ณ ๊ณผ์ • (JSON ํ˜•์‹)
104
  - **๋‹ค์ค‘ ์†Œ์Šค ์ข…ํ•ฉ**: ๋‰ด์Šค, ์‹œ์„ธ, ์ˆ˜๊ธ‰ ๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ๋ถ„์„
 
231
  ## ํˆฌ์ž ์˜๊ฒฌ
232
  ```
233
 
 
 
 
 
 
 
 
 
 
234
  ## DPO Improvements
235
 
236
  - โœ… **์ค‘๊ตญ์–ด leak ์ œ๊ฑฐ**: Stress test 10/10 CLEAN
 
261
 
262
  | ๋ฒ„์ „ | ๋‚ ์งœ | ๋ณ€๊ฒฝ์‚ฌํ•ญ |
263
  |------|------|----------|
264
+ | v1.1 | 2026-02-12 | GGUF ์–‘์žํ™” ๋ชจ๋ธ ์ถ”๊ฐ€ (Q4_K_M, Q8_0), ๋ฒค์น˜๋งˆํฌ, ํ•™์Šต ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๊ณต๊ฐœ |
265
  | v1.0 | 2026-01-28 | DPO ๋ณ‘ํ•ฉ, ์ค‘๊ตญ์–ด/์˜์–ด leak ํ•ด๊ฒฐ |
266
  | v0.9 | 2026-01-15 | SFT ๋ฒ ์ด์Šค ๋ชจ๋ธ ๊ณต๊ฐœ |
267