77ethers commited on
Commit
c1f9ee3
Β·
verified Β·
1 Parent(s): fa5c430

v6_sft_only_v2: training log

Browse files
Files changed (1) hide show
  1. v6_sft_only_v2/training.log +615 -0
v6_sft_only_v2/training.log ADDED
@@ -0,0 +1,615 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ πŸ¦₯ Unsloth: Will patch your computer to enable 2x faster free finetuning.
2
+ πŸ¦₯ Unsloth Zoo will now patch everything to make training faster!
3
+ Loading unsloth/Qwen3-4B-Instruct-2507...
4
+ ==((====))== Unsloth 2026.4.8: Fast Qwen3 patching. Transformers: 4.56.2. vLLM: 0.15.1.
5
+ \\ /| NVIDIA L40S. Num GPUs = 1. Max memory: 44.392 GB. Platform: Linux.
6
+ O^O/ \_/ \ Torch: 2.9.1+cu128. CUDA: 8.9. CUDA Toolkit: 12.8. Triton: 3.5.1
7
+ \ / Bfloat16 = TRUE. FA [Xformers = 0.0.33.post2. FA2 = False]
8
+ "-____-" Free license: http://github.com/unslothai/unsloth
9
+ Unsloth: Fast downloading is enabled - ignore downloading bars which are red colored!
10
+
11
+
12
+ model.safetensors.index.json: 0%| | 0.00/32.9k [00:00<?, ?B/s]
13
+ model.safetensors.index.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 32.9k/32.9k [00:00<00:00, 135MB/s]
14
+
15
+
16
+ model-00001-of-00002.safetensors: 0%| | 0.00/4.97G [00:00<?, ?B/s]
17
+
18
+ model-00001-of-00002.safetensors: 1%|▏ | 67.1M/4.97G [00:01<01:19, 61.6MB/s]
19
+
20
+ model-00001-of-00002.safetensors: 11%|β–ˆ | 536M/4.97G [00:02<00:17, 259MB/s] 
21
+
22
+ model-00001-of-00002.safetensors: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3.08G/4.97G [00:03<00:01, 1.08GB/s]
23
+ model-00001-of-00002.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.97G/4.97G [00:03<00:00, 1.29GB/s]
24
+
25
+
26
+ model-00002-of-00002.safetensors: 0%| | 0.00/3.08G [00:00<?, ?B/s]
27
+
28
+ model-00002-of-00002.safetensors: 0%| | 0.00/3.08G [00:01<?, ?B/s]
29
+
30
+ model-00002-of-00002.safetensors: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1.54G/3.08G [00:02<00:01, 1.30GB/s]
31
+ model-00002-of-00002.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 3.08G/3.08G [00:03<00:00, 1.02GB/s]
32
+
33
+
34
+ Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s]
35
+ Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00, 2.63it/s]
36
+
37
+
38
+ generation_config.json: 0%| | 0.00/237 [00:00<?, ?B/s]
39
+ generation_config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 237/237 [00:00<00:00, 2.27MB/s]
40
+
41
+
42
+ tokenizer_config.json: 0%| | 0.00/9.65k [00:00<?, ?B/s]
43
+ tokenizer_config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 9.65k/9.65k [00:00<00:00, 61.1MB/s]
44
+
45
+
46
+ vocab.json: 0%| | 0.00/2.78M [00:00<?, ?B/s]
47
+ vocab.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2.78M/2.78M [00:00<00:00, 40.9MB/s]
48
+
49
+
50
+ merges.txt: 0%| | 0.00/1.67M [00:00<?, ?B/s]
51
+ merges.txt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.67M/1.67M [00:00<00:00, 128MB/s]
52
+
53
+
54
+ added_tokens.json: 0%| | 0.00/707 [00:00<?, ?B/s]
55
+ added_tokens.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 707/707 [00:00<00:00, 6.15MB/s]
56
+
57
+
58
+ special_tokens_map.json: 0%| | 0.00/614 [00:00<?, ?B/s]
59
+ special_tokens_map.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 614/614 [00:00<00:00, 4.61MB/s]
60
+
61
+
62
+ tokenizer.json: 0%| | 0.00/11.4M [00:00<?, ?B/s]
63
+ tokenizer.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 11.4M/11.4M [00:00<00:00, 54.2MB/s]
64
+
65
+
66
+ chat_template.jinja: 0%| | 0.00/4.04k [00:00<?, ?B/s]
67
+ chat_template.jinja: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.04k/4.04k [00:00<00:00, 27.2MB/s]
68
+ unsloth/Qwen3-4B-Instruct-2507 does not have a padding token! Will use pad_token = <|PAD_TOKEN|>.
69
+ Unsloth 2026.4.8 patched 36 layers with 36 QKV layers, 36 O layers and 36 MLP layers.
70
+ VRAM allocated: 8.20 GB
71
+
72
+ ══ SFT warm-start β€” sft_traces/merged_v6_aligned.jsonl ══
73
+ 200 SFT examples loaded (chat format in `text`)
74
+
75
+
76
+ Unsloth: Tokenizing ["text"] (num_proc=12): 0%| | 0/200 [00:00<?, ? examples/s]
77
+
78
+ Unsloth: Tokenizing ["text"] (num_proc=12): 26%|β–ˆβ–ˆβ–Œ | 51/200 [00:01<00:03, 44.59 examples/s]
79
+ Unsloth: Tokenizing ["text"] (num_proc=12): 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 200/200 [00:02<00:00, 95.33 examples/s]
80
+ πŸ¦₯ Unsloth: Padding-free auto-enabled, enabling faster training.
81
+ ==((====))== Unsloth - 2x faster free finetuning | Num GPUs used = 1
82
+ \\ /| Num examples = 200 | Num Epochs = 6 | Total steps = 150
83
+ O^O/ \_/ \ Batch size per device = 2 | Gradient accumulation steps = 4
84
+ \ / Data Parallel GPUs = 1 | Total batch size (2 x 4 x 1) = 8
85
+ "-____-" Trainable parameters = 33,030,144 of 4,055,498,240 (0.81% trained)
86
+
87
+
88
+ 0%| | 0/150 [00:00<?, ?it/s]Unsloth: Will smartly offload gradients to save VRAM!
89
+
90
+
91
+ 1%| | 1/150 [00:06<17:00, 6.85s/it]
92
+
93
+ 1%|▏ | 2/150 [00:08<09:12, 3.73s/it]
94
+
95
+ 2%|▏ | 3/150 [00:09<06:35, 2.69s/it]
96
+
97
+ 3%|β–Ž | 4/150 [00:11<05:22, 2.21s/it]
98
+
99
+ 3%|β–Ž | 5/150 [00:12<04:42, 1.95s/it]
100
+
101
+
102
+ {'loss': 3.2003, 'grad_norm': 2.315643072128296, 'learning_rate': 2.5e-05, 'epoch': 0.2}
103
+
104
+
105
+ 3%|β–Ž | 5/150 [00:12<04:42, 1.95s/it]
106
+
107
+ 4%|▍ | 6/150 [00:14<04:18, 1.79s/it]
108
+
109
+ 5%|▍ | 7/150 [00:15<04:03, 1.70s/it]
110
+
111
+ 5%|β–Œ | 8/150 [00:17<03:52, 1.64s/it]
112
+
113
+ 6%|β–Œ | 9/150 [00:18<03:44, 1.59s/it]
114
+
115
+ 7%|β–‹ | 10/150 [00:20<03:38, 1.56s/it]
116
+
117
+
118
+ {'loss': 2.9439, 'grad_norm': 1.7202048301696777, 'learning_rate': 4.9647887323943665e-05, 'epoch': 0.4}
119
+
120
+
121
+ 7%|β–‹ | 10/150 [00:20<03:38, 1.56s/it]
122
+
123
+ 7%|β–‹ | 11/150 [00:21<03:34, 1.54s/it]
124
+
125
+ 8%|β–Š | 12/150 [00:23<03:30, 1.52s/it]
126
+
127
+ 9%|β–Š | 13/150 [00:24<03:29, 1.53s/it]
128
+
129
+ 9%|β–‰ | 14/150 [00:26<03:28, 1.53s/it]
130
+
131
+ 10%|β–ˆ | 15/150 [00:27<03:24, 1.51s/it]
132
+
133
+
134
+ {'loss': 2.4466, 'grad_norm': 0.7943887114524841, 'learning_rate': 4.788732394366197e-05, 'epoch': 0.6}
135
+
136
+
137
+ 10%|β–ˆ | 15/150 [00:27<03:24, 1.51s/it]
138
+
139
+ 11%|β–ˆ | 16/150 [00:29<03:23, 1.52s/it]
140
+
141
+ 11%|β–ˆβ– | 17/150 [00:30<03:20, 1.51s/it]
142
+
143
+ 12%|β–ˆβ– | 18/150 [00:32<03:19, 1.51s/it]
144
+
145
+ 13%|β–ˆβ–Ž | 19/150 [00:33<03:15, 1.50s/it]
146
+
147
+ 13%|β–ˆβ–Ž | 20/150 [00:35<03:15, 1.50s/it]
148
+
149
+
150
+ {'loss': 2.163, 'grad_norm': 0.7801544666290283, 'learning_rate': 4.6126760563380286e-05, 'epoch': 0.8}
151
+
152
+
153
+ 13%|β–ˆβ–Ž | 20/150 [00:35<03:15, 1.50s/it]
154
+
155
+ 14%|β–ˆβ– | 21/150 [00:36<03:16, 1.52s/it]
156
+
157
+ 15%|β–ˆβ– | 22/150 [00:38<03:13, 1.51s/it]
158
+
159
+ 15%|β–ˆβ–Œ | 23/150 [00:39<03:12, 1.52s/it]
160
+
161
+ 16%|β–ˆβ–Œ | 24/150 [00:41<03:09, 1.50s/it]
162
+
163
+ 17%|β–ˆβ–‹ | 25/150 [00:42<03:07, 1.50s/it]
164
+
165
+
166
+ {'loss': 1.9002, 'grad_norm': 0.780380129814148, 'learning_rate': 4.436619718309859e-05, 'epoch': 1.0}
167
+
168
+
169
+ 17%|β–ˆβ–‹ | 25/150 [00:42<03:07, 1.50s/it]
170
+
171
+ 17%|β–ˆβ–‹ | 26/150 [00:44<03:07, 1.52s/it]
172
+
173
+ 18%|β–ˆβ–Š | 27/150 [00:45<03:06, 1.51s/it]
174
+
175
+ 19%|β–ˆβ–Š | 28/150 [00:47<03:01, 1.49s/it]
176
+
177
+ 19%|β–ˆβ–‰ | 29/150 [00:48<03:01, 1.50s/it]
178
+
179
+ 20%|β–ˆβ–ˆ | 30/150 [00:50<03:01, 1.51s/it]
180
+
181
+
182
+ {'loss': 1.6575, 'grad_norm': 0.7079782485961914, 'learning_rate': 4.26056338028169e-05, 'epoch': 1.2}
183
+
184
+
185
+ 20%|β–ˆβ–ˆ | 30/150 [00:50<03:01, 1.51s/it]
186
+
187
+ 21%|β–ˆβ–ˆ | 31/150 [00:51<02:56, 1.48s/it]
188
+
189
+ 21%|β–ˆβ–ˆβ– | 32/150 [00:53<02:54, 1.48s/it]
190
+
191
+ 22%|β–ˆβ–ˆβ– | 33/150 [00:54<02:54, 1.50s/it]
192
+
193
+ 23%|β–ˆβ–ˆβ–Ž | 34/150 [00:56<02:54, 1.50s/it]
194
+
195
+ 23%|β–ˆβ–ˆβ–Ž | 35/150 [00:57<02:54, 1.52s/it]
196
+
197
+
198
+ {'loss': 1.4154, 'grad_norm': 0.7800647616386414, 'learning_rate': 4.0845070422535214e-05, 'epoch': 1.4}
199
+
200
+
201
+ 23%|β–ˆβ–ˆβ–Ž | 35/150 [00:57<02:54, 1.52s/it]
202
+
203
+ 24%|β–ˆβ–ˆβ– | 36/150 [00:59<02:54, 1.53s/it]
204
+
205
+ 25%|β–ˆβ–ˆβ– | 37/150 [01:01<02:52, 1.53s/it]
206
+
207
+ 25%|β–ˆβ–ˆβ–Œ | 38/150 [01:02<02:49, 1.51s/it]
208
+
209
+ 26%|β–ˆβ–ˆβ–Œ | 39/150 [01:03<02:47, 1.51s/it]
210
+
211
+ 27%|β–ˆβ–ˆβ–‹ | 40/150 [01:05<02:48, 1.53s/it]
212
+
213
+
214
+ {'loss': 1.1887, 'grad_norm': 0.7795201539993286, 'learning_rate': 3.908450704225352e-05, 'epoch': 1.6}
215
+
216
+
217
+ 27%|β–ˆβ–ˆβ–‹ | 40/150 [01:05<02:48, 1.53s/it]
218
+
219
+ 27%|β–ˆβ–ˆβ–‹ | 41/150 [01:07<02:45, 1.52s/it]
220
+
221
+ 28%|β–ˆβ–ˆβ–Š | 42/150 [01:08<02:43, 1.52s/it]
222
+
223
+ 29%|β–ˆβ–ˆβ–Š | 43/150 [01:10<02:44, 1.54s/it]
224
+
225
+ 29%|β–ˆβ–ˆβ–‰ | 44/150 [01:11<02:40, 1.51s/it]
226
+
227
+ 30%|β–ˆβ–ˆβ–ˆ | 45/150 [01:13<02:39, 1.52s/it]
228
+
229
+
230
+ {'loss': 0.9898, 'grad_norm': 0.5640354752540588, 'learning_rate': 3.7323943661971835e-05, 'epoch': 1.8}
231
+
232
+
233
+ 30%|β–ˆβ–ˆβ–ˆ | 45/150 [01:13<02:39, 1.52s/it]
234
+
235
+ 31%|β–ˆβ–ˆβ–ˆ | 46/150 [01:14<02:38, 1.52s/it]
236
+
237
+ 31%|β–ˆβ–ˆβ–ˆβ– | 47/150 [01:16<02:35, 1.51s/it]
238
+
239
+ 32%|β–ˆβ–ˆβ–ˆβ– | 48/150 [01:17<02:34, 1.52s/it]
240
+
241
+ 33%|β–ˆβ–ˆβ–ˆβ–Ž | 49/150 [01:19<02:33, 1.52s/it]
242
+
243
+ 33%|β–ˆβ–ˆβ–ˆβ–Ž | 50/150 [01:20<02:31, 1.51s/it]
244
+
245
+
246
+ {'loss': 0.9291, 'grad_norm': 0.43140193819999695, 'learning_rate': 3.556338028169014e-05, 'epoch': 2.0}
247
+
248
+
249
+ 33%|β–ˆβ–ˆβ–ˆβ–Ž | 50/150 [01:20<02:31, 1.51s/it]
250
+
251
+ 34%|β–ˆβ–ˆβ–ˆβ– | 51/150 [01:22<02:30, 1.52s/it]
252
+
253
+ 35%|β–ˆβ–ˆβ–ˆβ– | 52/150 [01:23<02:29, 1.53s/it]
254
+
255
+ 35%|β–ˆβ–ˆβ–ˆβ–Œ | 53/150 [01:25<02:27, 1.52s/it]
256
+
257
+ 36%|β–ˆβ–ˆβ–ˆβ–Œ | 54/150 [01:26<02:23, 1.49s/it]
258
+
259
+ 37%|β–ˆβ–ˆβ–ˆβ–‹ | 55/150 [01:28<02:21, 1.49s/it]
260
+
261
+
262
+ {'loss': 0.8827, 'grad_norm': 0.3559304475784302, 'learning_rate': 3.380281690140845e-05, 'epoch': 2.2}
263
+
264
+
265
+ 37%|β–ˆβ–ˆβ–ˆβ–‹ | 55/150 [01:28<02:21, 1.49s/it]
266
+
267
+ 37%|β–ˆβ–ˆβ–ˆβ–‹ | 56/150 [01:29<02:21, 1.51s/it]
268
+
269
+ 38%|β–ˆβ–ˆβ–ˆβ–Š | 57/150 [01:31<02:20, 1.51s/it]
270
+
271
+ 39%|β–ˆβ–ˆβ–ˆβ–Š | 58/150 [01:32<02:19, 1.52s/it]
272
+
273
+ 39%|β–ˆβ–ˆβ–ˆβ–‰ | 59/150 [01:34<02:19, 1.53s/it]
274
+
275
+ 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 60/150 [01:35<02:18, 1.54s/it]
276
+
277
+
278
+ {'loss': 0.8919, 'grad_norm': 0.3519206643104553, 'learning_rate': 3.204225352112676e-05, 'epoch': 2.4}
279
+
280
+
281
+ 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 60/150 [01:35<02:18, 1.54s/it]
282
+
283
+ 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 61/150 [01:37<02:16, 1.53s/it]
284
+
285
+ 41%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 62/150 [01:38<02:14, 1.53s/it]
286
+
287
+ 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 63/150 [01:40<02:12, 1.52s/it]
288
+
289
+ 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 64/150 [01:42<02:11, 1.53s/it]
290
+
291
+ 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 65/150 [01:43<02:09, 1.53s/it]
292
+
293
+
294
+ {'loss': 0.869, 'grad_norm': 0.3210051655769348, 'learning_rate': 3.028169014084507e-05, 'epoch': 2.6}
295
+
296
+
297
+ 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 65/150 [01:43<02:09, 1.53s/it]
298
+
299
+ 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 66/150 [01:45<02:08, 1.53s/it]
300
+
301
+ 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 67/150 [01:46<02:05, 1.51s/it]
302
+
303
+ 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 68/150 [01:48<02:03, 1.50s/it]
304
+
305
+ 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 69/150 [01:49<02:02, 1.51s/it]
306
+
307
+ 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 70/150 [01:51<02:02, 1.53s/it]
308
+
309
+
310
+ {'loss': 0.8251, 'grad_norm': 0.31960350275039673, 'learning_rate': 2.8521126760563384e-05, 'epoch': 2.8}
311
+
312
+
313
+ 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 70/150 [01:51<02:02, 1.53s/it]
314
+
315
+ 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 71/150 [01:52<02:01, 1.54s/it]
316
+
317
+ 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 72/150 [01:54<01:59, 1.53s/it]
318
+
319
+ 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 73/150 [01:55<01:58, 1.54s/it]
320
+
321
+ 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 74/150 [01:57<01:56, 1.54s/it]
322
+
323
+ 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 75/150 [01:58<01:54, 1.53s/it]
324
+
325
+
326
+ {'loss': 0.8397, 'grad_norm': 0.3365177512168884, 'learning_rate': 2.676056338028169e-05, 'epoch': 3.0}
327
+
328
+
329
+ 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 75/150 [01:58<01:54, 1.53s/it]
330
+
331
+ 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 76/150 [02:00<01:52, 1.52s/it]
332
+
333
+ 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 77/150 [02:01<01:50, 1.52s/it]
334
+
335
+ 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 78/150 [02:03<01:50, 1.53s/it]
336
+
337
+ 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 79/150 [02:04<01:49, 1.54s/it]
338
+
339
+ 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 80/150 [02:06<01:48, 1.54s/it]
340
+
341
+
342
+ {'loss': 0.7999, 'grad_norm': 0.3121998608112335, 'learning_rate': 2.5e-05, 'epoch': 3.2}
343
+
344
+
345
+ 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 80/150 [02:06<01:48, 1.54s/it]
346
+
347
+ 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 81/150 [02:08<01:45, 1.53s/it]
348
+
349
+ 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 82/150 [02:09<01:43, 1.53s/it]
350
+
351
+ 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 83/150 [02:11<01:42, 1.52s/it]
352
+
353
+ 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 84/150 [02:12<01:42, 1.55s/it]
354
+
355
+ 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 85/150 [02:14<01:40, 1.54s/it]
356
+
357
+
358
+ {'loss': 0.8139, 'grad_norm': 0.33012324571609497, 'learning_rate': 2.323943661971831e-05, 'epoch': 3.4}
359
+
360
+
361
+ 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 85/150 [02:14<01:40, 1.54s/it]
362
+
363
+ 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 86/150 [02:15<01:37, 1.53s/it]
364
+
365
+ 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 87/150 [02:17<01:36, 1.54s/it]
366
+
367
+ 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 88/150 [02:18<01:34, 1.53s/it]
368
+
369
+ 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 89/150 [02:20<01:33, 1.53s/it]
370
+
371
+ 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 90/150 [02:21<01:31, 1.52s/it]
372
+
373
+
374
+ {'loss': 0.7965, 'grad_norm': 0.325542151927948, 'learning_rate': 2.147887323943662e-05, 'epoch': 3.6}
375
+
376
+
377
+ 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 90/150 [02:21<01:31, 1.52s/it]
378
+
379
+ 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 91/150 [02:23<01:29, 1.52s/it]
380
+
381
+ 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 92/150 [02:24<01:28, 1.53s/it]
382
+
383
+ 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 93/150 [02:26<01:27, 1.54s/it]
384
+
385
+ 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 94/150 [02:27<01:26, 1.54s/it]
386
+
387
+ 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 95/150 [02:29<01:24, 1.53s/it]
388
+
389
+
390
+ {'loss': 0.8084, 'grad_norm': 0.31672295928001404, 'learning_rate': 1.971830985915493e-05, 'epoch': 3.8}
391
+
392
+
393
+ 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 95/150 [02:29<01:24, 1.53s/it]
394
+
395
+ 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 96/150 [02:31<01:22, 1.54s/it]
396
+
397
+ 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 97/150 [02:32<01:21, 1.54s/it]
398
+
399
+ 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 98/150 [02:33<01:18, 1.51s/it]
400
+
401
+ 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 99/150 [02:35<01:17, 1.52s/it]
402
+
403
+ 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 100/150 [02:37<01:16, 1.53s/it]
404
+
405
+
406
+ {'loss': 0.8154, 'grad_norm': 0.3328615128993988, 'learning_rate': 1.7957746478873243e-05, 'epoch': 4.0}
407
+
408
+
409
+ 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 100/150 [02:37<01:16, 1.53s/it]
410
+
411
+ 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 101/150 [02:38<01:15, 1.54s/it]
412
+
413
+ 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 102/150 [02:40<01:13, 1.54s/it]
414
+
415
+ 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 103/150 [02:41<01:12, 1.54s/it]
416
+
417
+ 69%|β–ˆβ–ˆβ–ˆβ–ˆοΏ½οΏ½οΏ½β–ˆβ–‰ | 104/150 [02:43<01:11, 1.56s/it]
418
+
419
+ 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 105/150 [02:44<01:09, 1.55s/it]
420
+
421
+
422
+ {'loss': 0.7967, 'grad_norm': 0.32036644220352173, 'learning_rate': 1.619718309859155e-05, 'epoch': 4.2}
423
+
424
+
425
+ 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 105/150 [02:44<01:09, 1.55s/it]
426
+
427
+ 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 106/150 [02:46<01:08, 1.55s/it]
428
+
429
+ 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 107/150 [02:47<01:06, 1.55s/it]
430
+
431
+ 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 108/150 [02:49<01:05, 1.56s/it]
432
+
433
+ 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 109/150 [02:51<01:03, 1.55s/it]
434
+
435
+ 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 110/150 [02:52<01:01, 1.53s/it]
436
+
437
+
438
+ {'loss': 0.7825, 'grad_norm': 0.33183553814888, 'learning_rate': 1.443661971830986e-05, 'epoch': 4.4}
439
+
440
+
441
+ 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 110/150 [02:52<01:01, 1.53s/it]
442
+
443
+ 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 111/150 [02:54<00:59, 1.53s/it]
444
+
445
+ 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 112/150 [02:55<00:56, 1.50s/it]
446
+
447
+ 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 113/150 [02:57<00:55, 1.51s/it]
448
+
449
+ 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 114/150 [02:58<00:54, 1.52s/it]
450
+
451
+ 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 115/150 [03:00<00:52, 1.51s/it]
452
+
453
+
454
+ {'loss': 0.7707, 'grad_norm': 0.33796223998069763, 'learning_rate': 1.267605633802817e-05, 'epoch': 4.6}
455
+
456
+
457
+ 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 115/150 [03:00<00:52, 1.51s/it]
458
+
459
+ 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 116/150 [03:01<00:51, 1.51s/it]
460
+
461
+ 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 117/150 [03:03<00:50, 1.52s/it]
462
+
463
+ 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 118/150 [03:04<00:48, 1.52s/it]
464
+
465
+ 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 119/150 [03:06<00:46, 1.49s/it]
466
+
467
+ 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 120/150 [03:07<00:45, 1.51s/it]
468
+
469
+
470
+ {'loss': 0.7708, 'grad_norm': 0.33394232392311096, 'learning_rate': 1.0915492957746478e-05, 'epoch': 4.8}
471
+
472
+
473
+ 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 120/150 [03:07<00:45, 1.51s/it]
474
+
475
+ 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 121/150 [03:09<00:44, 1.53s/it]
476
+
477
+ 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 122/150 [03:10<00:43, 1.54s/it]
478
+
479
+ 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 123/150 [03:12<00:41, 1.53s/it]
480
+
481
+ 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 124/150 [03:13<00:39, 1.50s/it]
482
+
483
+ 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 125/150 [03:15<00:37, 1.51s/it]
484
+
485
+
486
+ {'loss': 0.7812, 'grad_norm': 0.32235094904899597, 'learning_rate': 9.15492957746479e-06, 'epoch': 5.0}
487
+
488
+
489
+ 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 125/150 [03:15<00:37, 1.51s/it]
490
+
491
+ 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 126/150 [03:16<00:36, 1.50s/it]
492
+
493
+ 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 127/150 [03:18<00:34, 1.51s/it]
494
+
495
+ 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 128/150 [03:19<00:32, 1.49s/it]
496
+
497
+ 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 129/150 [03:21<00:31, 1.50s/it]
498
+
499
+ 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 130/150 [03:22<00:29, 1.48s/it]
500
+
501
+
502
+ {'loss': 0.7453, 'grad_norm': 0.3531821668148041, 'learning_rate': 7.394366197183099e-06, 'epoch': 5.2}
503
+
504
+
505
+ 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 130/150 [03:22<00:29, 1.48s/it]
506
+
507
+ 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 131/150 [03:24<00:28, 1.51s/it]
508
+
509
+ 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 132/150 [03:25<00:27, 1.50s/it]
510
+
511
+ 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 133/150 [03:27<00:25, 1.50s/it]
512
+
513
+ 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 134/150 [03:28<00:23, 1.49s/it]
514
+
515
+ 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 135/150 [03:30<00:22, 1.51s/it]
516
+
517
+
518
+ {'loss': 0.7588, 'grad_norm': 0.3414919972419739, 'learning_rate': 5.6338028169014084e-06, 'epoch': 5.4}
519
+
520
+
521
+ 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 135/150 [03:30<00:22, 1.51s/it]
522
+
523
+ 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 136/150 [03:31<00:21, 1.51s/it]
524
+
525
+ 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 137/150 [03:33<00:19, 1.51s/it]
526
+
527
+ 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 138/150 [03:34<00:18, 1.53s/it]
528
+
529
+ 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 139/150 [03:36<00:16, 1.52s/it]
530
+
531
+ 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 140/150 [03:37<00:15, 1.54s/it]
532
+
533
+
534
+ {'loss': 0.7859, 'grad_norm': 0.33363598585128784, 'learning_rate': 3.873239436619718e-06, 'epoch': 5.6}
535
+
536
+
537
+ 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 140/150 [03:37<00:15, 1.54s/it]
538
+
539
+ 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 141/150 [03:39<00:13, 1.53s/it]
540
+
541
+ 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 142/150 [03:40<00:12, 1.52s/it]
542
+
543
+ 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 143/150 [03:42<00:10, 1.52s/it]
544
+
545
+ 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 144/150 [03:43<00:09, 1.53s/it]
546
+
547
+ 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 145/150 [03:45<00:07, 1.52s/it]
548
+
549
+
550
+ {'loss': 0.7676, 'grad_norm': 0.3417748510837555, 'learning_rate': 2.112676056338028e-06, 'epoch': 5.8}
551
+
552
+
553
+ 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 145/150 [03:45<00:07, 1.52s/it]
554
+
555
+ 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 146/150 [03:46<00:06, 1.51s/it]
556
+
557
+ 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 147/150 [03:48<00:04, 1.52s/it]
558
+
559
+ 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 148/150 [03:49<00:02, 1.50s/it]
560
+
561
+ 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 149/150 [03:51<00:01, 1.49s/it]
562
+
563
+ 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 150/150 [03:53<00:00, 1.53s/it]
564
+
565
+
566
+ {'loss': 0.7767, 'grad_norm': 0.3764980137348175, 'learning_rate': 3.5211267605633803e-07, 'epoch': 6.0}
567
+
568
+
569
+ 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 150/150 [03:53<00:00, 1.53s/it]
570
+
571
+
572
+ {'train_runtime': 233.6201, 'train_samples_per_second': 5.137, 'train_steps_per_second': 0.642, 'train_loss': 1.1637831672032675, 'epoch': 6.0}
573
+
574
+
575
+ 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 150/150 [03:53<00:00, 1.53s/it]
576
+ 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 150/150 [03:53<00:00, 1.56s/it]
577
+ SFT done in 3.9 min
578
+
579
+ ══ Pre-GRPO hold-out eval (SFT-only) ══
580
+
581
+ [diagnostic] seed=100 raw completion (first 500 chars):
582
+ </tool_call>
583
+ The news signals a severe supply chain fragmentation and transition shock. 1st-order: Tech faces immediate headwinds from export controls and rare-earth shortages, while OIL benefits from sector rotation. 2nd-order: GREEN faces supply chain constraints, but the transition shock provides a tailwind. 3rd-order: BONDS and REAL_ESTATE are safe havens.
584
+ Given the 12-quarter lockup, I must avoid OIL due to the 25 kg carbon cap. A 0.25 allocation to OIL would emit 2.5 * 0.25 * 12 = 7.5 kg,
585
+ [parse_action result]: metadata={} weights=[0.35, 0.0, 0.45, 0.15, 0.05] infra_commit=0.15 carbon_offset_buy=0.0 put_hedge=0.0 tech_bet='fragmentation'
586
+
587
+ ── Hold-out eval (5/5 valid) ──
588
+ mean regret: +0.0340
589
+ beat baseline: 3/5
590
+ Found HuggingFace hub cache directory: /tmp/CarbonAlpha/hf_cache/hub
591
+ Checking cache directory for required files...
592
+
593
+
594
+ Unsloth: Copying 2 files from cache to `/tmp/CarbonAlpha/checkpoints/final_merged`: 0%| | 0/2 [00:00<?, ?it/s]
595
+
596
+ Unsloth: Copying 2 files from cache to `/tmp/CarbonAlpha/checkpoints/final_merged`: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:01<00:00, 1.40it/s]
597
+ Unsloth: Copying 2 files from cache to `/tmp/CarbonAlpha/checkpoints/final_merged`: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:01<00:00, 1.40it/s]
598
+ Successfully copied all 2 files from cache to `/tmp/CarbonAlpha/checkpoints/final_merged`
599
+ Checking cache directory for required files...
600
+ Cache check failed: tokenizer.model not found in local cache.
601
+ Not all required files found in cache. Will proceed with downloading.
602
+
603
+
604
+ Unsloth: Preparing safetensor model files: 0%| | 0/2 [00:00<?, ?it/s]
605
+ Unsloth: Preparing safetensor model files: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00, 57065.36it/s]
606
+
607
+
608
+ Unsloth: Merging weights into 16bit: 0%| | 0/2 [00:00<?, ?it/s]
609
+
610
+ Unsloth: Merging weights into 16bit: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 1/2 [00:30<00:30, 30.52s/it]
611
+
612
+ Unsloth: Merging weights into 16bit: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:47<00:00, 22.45s/it]
613
+ Unsloth: Merging weights into 16bit: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:47<00:00, 23.66s/it]
614
+ Unsloth: Merge process complete. Saved to `/tmp/CarbonAlpha/checkpoints/final_merged`
615
+ SFT-only mode. Saved LoRA adapters to /tmp/CarbonAlpha/checkpoints/final_merged