Instructions to use melihcatal/codedp-cpt-models-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use melihcatal/codedp-cpt-models-v2 with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
File size: 19,501 Bytes
076fd74 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 | 2026-03-26 21:48:00,537 [INFO] new_opacus_codex.train_steps: epoch=1 step=5 loss=2.0965
2026-03-26 21:48:26,199 [INFO] new_opacus_codex.train_steps: epoch=1 step=10 loss=2.0740
2026-03-26 21:48:40,447 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=10 eval_loss=0.8438 duration_sec=14.23
2026-03-26 21:49:05,242 [INFO] new_opacus_codex.train_steps: epoch=1 step=15 loss=2.0506
2026-03-26 21:49:31,282 [INFO] new_opacus_codex.train_steps: epoch=1 step=20 loss=2.0738
2026-03-26 21:49:45,556 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=20 eval_loss=0.8438 duration_sec=14.26
2026-03-26 21:50:09,610 [INFO] new_opacus_codex.train_steps: epoch=1 step=25 loss=2.2433
2026-03-26 21:50:35,960 [INFO] new_opacus_codex.train_steps: epoch=1 step=30 loss=2.1463
2026-03-26 21:50:50,275 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=30 eval_loss=0.8438 duration_sec=14.30
2026-03-26 21:51:14,645 [INFO] new_opacus_codex.train_steps: epoch=1 step=35 loss=1.8533
2026-03-26 21:51:40,961 [INFO] new_opacus_codex.train_steps: epoch=1 step=40 loss=2.0047
2026-03-26 21:51:55,282 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=40 eval_loss=0.8439 duration_sec=14.31
2026-03-26 21:52:19,559 [INFO] new_opacus_codex.train_steps: epoch=1 step=45 loss=2.2556
2026-03-26 21:52:46,158 [INFO] new_opacus_codex.train_steps: epoch=1 step=50 loss=2.0732
2026-03-26 21:53:00,487 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=50 eval_loss=0.8442 duration_sec=14.32
2026-03-26 21:53:25,407 [INFO] new_opacus_codex.train_steps: epoch=1 step=55 loss=1.8839
2026-03-26 21:53:50,653 [INFO] new_opacus_codex.train_steps: epoch=1 step=60 loss=2.0296
2026-03-26 21:54:04,964 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=60 eval_loss=0.8444 duration_sec=14.30
2026-03-26 21:54:29,810 [INFO] new_opacus_codex.train_steps: epoch=1 step=65 loss=2.2169
2026-03-26 21:54:55,208 [INFO] new_opacus_codex.train_steps: epoch=1 step=70 loss=2.2310
2026-03-26 21:55:09,527 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=70 eval_loss=0.8447 duration_sec=14.31
2026-03-26 21:55:35,398 [INFO] new_opacus_codex.train_steps: epoch=1 step=75 loss=2.3162
2026-03-26 21:56:01,133 [INFO] new_opacus_codex.train_steps: epoch=1 step=80 loss=2.2552
2026-03-26 21:56:15,596 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=80 eval_loss=0.8449 duration_sec=14.45
2026-03-26 21:56:41,833 [INFO] new_opacus_codex.train_steps: epoch=1 step=85 loss=1.9948
2026-03-26 21:57:06,859 [INFO] new_opacus_codex.train_steps: epoch=1 step=90 loss=2.0550
2026-03-26 21:57:21,166 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=90 eval_loss=0.8452 duration_sec=14.29
2026-03-26 21:57:47,112 [INFO] new_opacus_codex.train_steps: epoch=1 step=95 loss=2.1835
2026-03-26 21:58:11,827 [INFO] new_opacus_codex.train_steps: epoch=1 step=100 loss=2.2523
2026-03-26 21:58:26,216 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=100 eval_loss=0.8454 duration_sec=14.37
2026-03-26 21:58:52,983 [INFO] new_opacus_codex.train_steps: epoch=1 step=105 loss=2.3648
2026-03-26 21:59:18,099 [INFO] new_opacus_codex.train_steps: epoch=1 step=110 loss=2.2856
2026-03-26 21:59:32,400 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=110 eval_loss=0.8456 duration_sec=14.29
2026-03-26 21:59:58,833 [INFO] new_opacus_codex.train_steps: epoch=1 step=115 loss=2.0431
2026-03-26 22:00:23,998 [INFO] new_opacus_codex.train_steps: epoch=1 step=120 loss=2.0698
2026-03-26 22:00:38,689 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=120 eval_loss=0.8458 duration_sec=14.67
2026-03-26 22:01:04,075 [INFO] new_opacus_codex.train_steps: epoch=1 step=125 loss=2.2100
2026-03-26 22:01:29,671 [INFO] new_opacus_codex.train_steps: epoch=1 step=130 loss=2.0161
2026-03-26 22:01:43,967 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=130 eval_loss=0.8460 duration_sec=14.28
2026-03-26 22:02:08,901 [INFO] new_opacus_codex.train_steps: epoch=1 step=135 loss=2.0243
2026-03-26 22:02:35,010 [INFO] new_opacus_codex.train_steps: epoch=1 step=140 loss=2.2631
2026-03-26 22:02:49,318 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=140 eval_loss=0.8463 duration_sec=14.30
2026-03-26 22:03:14,157 [INFO] new_opacus_codex.train_steps: epoch=1 step=145 loss=2.1024
2026-03-26 22:03:40,409 [INFO] new_opacus_codex.train_steps: epoch=1 step=150 loss=2.0706
2026-03-26 22:03:54,760 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=150 eval_loss=0.8465 duration_sec=14.34
2026-03-26 22:04:19,526 [INFO] new_opacus_codex.train_steps: epoch=1 step=155 loss=1.9638
2026-03-26 22:04:46,782 [INFO] new_opacus_codex.train_steps: epoch=1 step=160 loss=1.8354
2026-03-26 22:05:01,123 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=160 eval_loss=0.8467 duration_sec=14.33
2026-03-26 22:05:25,948 [INFO] new_opacus_codex.train_steps: epoch=1 step=165 loss=2.1498
2026-03-26 22:05:52,494 [INFO] new_opacus_codex.train_steps: epoch=1 step=170 loss=2.1747
2026-03-26 22:06:06,791 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=170 eval_loss=0.8468 duration_sec=14.29
2026-03-26 22:06:31,988 [INFO] new_opacus_codex.train_steps: epoch=1 step=175 loss=1.9538
2026-03-26 22:06:58,174 [INFO] new_opacus_codex.train_steps: epoch=1 step=180 loss=2.1538
2026-03-26 22:07:12,518 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=180 eval_loss=0.8469 duration_sec=14.33
2026-03-26 22:07:38,447 [INFO] new_opacus_codex.train_steps: epoch=1 step=185 loss=2.2217
2026-03-26 22:08:03,246 [INFO] new_opacus_codex.train_steps: epoch=1 step=190 loss=2.1630
2026-03-26 22:08:17,670 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=190 eval_loss=0.8470 duration_sec=14.41
2026-03-26 22:08:43,461 [INFO] new_opacus_codex.train_steps: epoch=1 step=195 loss=2.2596
2026-03-26 22:09:08,714 [INFO] new_opacus_codex.train_steps: epoch=1 step=200 loss=2.0670
2026-03-26 22:09:22,975 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=200 eval_loss=0.8472 duration_sec=14.25
2026-03-26 22:09:49,270 [INFO] new_opacus_codex.train_steps: epoch=1 step=205 loss=1.9729
2026-03-26 22:10:36,026 [INFO] new_opacus_codex.train_steps: epoch=2 step=210 loss=1.6895
2026-03-26 22:10:50,297 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=210 eval_loss=0.8473 duration_sec=14.27
2026-03-26 22:11:18,438 [INFO] new_opacus_codex.train_steps: epoch=2 step=215 loss=2.0352
2026-03-26 22:11:43,395 [INFO] new_opacus_codex.train_steps: epoch=2 step=220 loss=2.1403
2026-03-26 22:11:57,748 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=220 eval_loss=0.8475 duration_sec=14.33
2026-03-26 22:12:25,218 [INFO] new_opacus_codex.train_steps: epoch=2 step=225 loss=1.9779
2026-03-26 22:12:50,777 [INFO] new_opacus_codex.train_steps: epoch=2 step=230 loss=1.9893
2026-03-26 22:13:05,129 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=230 eval_loss=0.8476 duration_sec=14.34
2026-03-26 22:13:31,262 [INFO] new_opacus_codex.train_steps: epoch=2 step=235 loss=2.1829
2026-03-26 22:13:56,472 [INFO] new_opacus_codex.train_steps: epoch=2 step=240 loss=2.2177
2026-03-26 22:14:10,802 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=240 eval_loss=0.8478 duration_sec=14.32
2026-03-26 22:14:36,990 [INFO] new_opacus_codex.train_steps: epoch=2 step=245 loss=2.0736
2026-03-26 22:15:03,086 [INFO] new_opacus_codex.train_steps: epoch=2 step=250 loss=2.0885
2026-03-26 22:15:17,401 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=250 eval_loss=0.8478 duration_sec=14.31
2026-03-26 22:15:42,970 [INFO] new_opacus_codex.train_steps: epoch=2 step=255 loss=2.0921
2026-03-26 22:16:09,177 [INFO] new_opacus_codex.train_steps: epoch=2 step=260 loss=1.9281
2026-03-26 22:16:23,522 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=260 eval_loss=0.8479 duration_sec=14.34
2026-03-26 22:16:47,939 [INFO] new_opacus_codex.train_steps: epoch=2 step=265 loss=1.9739
2026-03-26 22:17:14,402 [INFO] new_opacus_codex.train_steps: epoch=2 step=270 loss=2.0684
2026-03-26 22:17:28,772 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=270 eval_loss=0.8479 duration_sec=14.35
2026-03-26 22:17:53,651 [INFO] new_opacus_codex.train_steps: epoch=2 step=275 loss=2.0468
2026-03-26 22:18:20,927 [INFO] new_opacus_codex.train_steps: epoch=2 step=280 loss=2.0730
2026-03-26 22:18:35,289 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=280 eval_loss=0.8480 duration_sec=14.36
2026-03-26 22:18:59,920 [INFO] new_opacus_codex.train_steps: epoch=2 step=285 loss=2.1239
2026-03-26 22:19:26,647 [INFO] new_opacus_codex.train_steps: epoch=2 step=290 loss=2.1021
2026-03-26 22:19:41,002 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=290 eval_loss=0.8481 duration_sec=14.34
2026-03-26 22:20:05,716 [INFO] new_opacus_codex.train_steps: epoch=2 step=295 loss=2.0917
2026-03-26 22:20:32,362 [INFO] new_opacus_codex.train_steps: epoch=2 step=300 loss=2.1965
2026-03-26 22:20:46,671 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=300 eval_loss=0.8482 duration_sec=14.29
2026-03-26 22:21:12,696 [INFO] new_opacus_codex.train_steps: epoch=2 step=305 loss=2.2463
2026-03-26 22:21:38,998 [INFO] new_opacus_codex.train_steps: epoch=2 step=310 loss=2.2180
2026-03-26 22:21:53,277 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=310 eval_loss=0.8483 duration_sec=14.19
2026-03-26 22:22:18,995 [INFO] new_opacus_codex.train_steps: epoch=2 step=315 loss=2.1983
2026-03-26 22:22:45,042 [INFO] new_opacus_codex.train_steps: epoch=2 step=320 loss=2.2371
2026-03-26 22:22:59,353 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=320 eval_loss=0.8483 duration_sec=14.18
2026-03-26 22:23:24,821 [INFO] new_opacus_codex.train_steps: epoch=2 step=325 loss=2.2170
2026-03-26 22:23:50,396 [INFO] new_opacus_codex.train_steps: epoch=2 step=330 loss=2.1873
2026-03-26 22:24:04,699 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=330 eval_loss=0.8483 duration_sec=14.27
2026-03-26 22:24:30,379 [INFO] new_opacus_codex.train_steps: epoch=2 step=335 loss=2.0300
2026-03-26 22:24:55,644 [INFO] new_opacus_codex.train_steps: epoch=2 step=340 loss=1.8838
2026-03-26 22:25:10,057 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=340 eval_loss=0.8483 duration_sec=14.33
2026-03-26 22:25:37,334 [INFO] new_opacus_codex.train_steps: epoch=2 step=345 loss=1.8080
2026-03-26 22:26:02,703 [INFO] new_opacus_codex.train_steps: epoch=2 step=350 loss=1.8755
2026-03-26 22:26:17,134 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=350 eval_loss=0.8484 duration_sec=14.29
2026-03-26 22:26:42,829 [INFO] new_opacus_codex.train_steps: epoch=2 step=355 loss=2.2443
2026-03-26 22:27:09,316 [INFO] new_opacus_codex.train_steps: epoch=2 step=360 loss=2.2905
2026-03-26 22:27:23,679 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=360 eval_loss=0.8484 duration_sec=14.23
2026-03-26 22:27:49,780 [INFO] new_opacus_codex.train_steps: epoch=2 step=365 loss=2.0968
2026-03-26 22:28:16,933 [INFO] new_opacus_codex.train_steps: epoch=2 step=370 loss=2.0605
2026-03-26 22:28:31,266 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=370 eval_loss=0.8484 duration_sec=14.22
2026-03-26 22:28:57,578 [INFO] new_opacus_codex.train_steps: epoch=2 step=375 loss=1.9692
2026-03-26 22:29:23,374 [INFO] new_opacus_codex.train_steps: epoch=2 step=380 loss=1.9666
2026-03-26 22:29:37,658 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=380 eval_loss=0.8484 duration_sec=14.27
2026-03-26 22:30:03,251 [INFO] new_opacus_codex.train_steps: epoch=2 step=385 loss=2.1136
2026-03-26 22:30:29,099 [INFO] new_opacus_codex.train_steps: epoch=2 step=390 loss=2.0575
2026-03-26 22:30:43,419 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=390 eval_loss=0.8485 duration_sec=14.31
2026-03-26 22:31:08,447 [INFO] new_opacus_codex.train_steps: epoch=2 step=395 loss=1.9648
2026-03-26 22:31:34,603 [INFO] new_opacus_codex.train_steps: epoch=2 step=400 loss=1.9254
2026-03-26 22:31:48,992 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=400 eval_loss=0.8485 duration_sec=14.38
2026-03-26 22:32:14,838 [INFO] new_opacus_codex.train_steps: epoch=2 step=405 loss=1.9092
2026-03-26 22:32:41,251 [INFO] new_opacus_codex.train_steps: epoch=2 step=410 loss=2.0961
2026-03-26 22:32:55,643 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=410 eval_loss=0.8485 duration_sec=14.38
2026-03-26 22:33:21,206 [INFO] new_opacus_codex.train_steps: epoch=2 step=415 loss=2.0573
2026-03-26 22:34:10,008 [INFO] new_opacus_codex.train_steps: epoch=3 step=420 loss=2.1745
2026-03-26 22:34:24,248 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=420 eval_loss=0.8485 duration_sec=14.23
2026-03-26 22:34:50,540 [INFO] new_opacus_codex.train_steps: epoch=3 step=425 loss=1.9973
2026-03-26 22:35:16,937 [INFO] new_opacus_codex.train_steps: epoch=3 step=430 loss=1.9931
2026-03-26 22:35:31,303 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=430 eval_loss=0.8485 duration_sec=14.34
2026-03-26 22:35:56,917 [INFO] new_opacus_codex.train_steps: epoch=3 step=435 loss=1.9746
2026-03-26 22:36:23,153 [INFO] new_opacus_codex.train_steps: epoch=3 step=440 loss=1.7725
2026-03-26 22:36:37,474 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=440 eval_loss=0.8485 duration_sec=14.31
2026-03-26 22:37:02,886 [INFO] new_opacus_codex.train_steps: epoch=3 step=445 loss=1.8957
2026-03-26 22:37:29,749 [INFO] new_opacus_codex.train_steps: epoch=3 step=450 loss=2.1199
2026-03-26 22:37:43,999 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=450 eval_loss=0.8485 duration_sec=14.24
2026-03-26 22:38:09,178 [INFO] new_opacus_codex.train_steps: epoch=3 step=455 loss=2.2018
2026-03-26 22:38:36,024 [INFO] new_opacus_codex.train_steps: epoch=3 step=460 loss=2.1082
2026-03-26 22:38:50,333 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=460 eval_loss=0.8485 duration_sec=14.29
2026-03-26 22:39:15,251 [INFO] new_opacus_codex.train_steps: epoch=3 step=465 loss=2.0519
2026-03-26 22:39:41,508 [INFO] new_opacus_codex.train_steps: epoch=3 step=470 loss=2.1447
2026-03-26 22:39:55,793 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=470 eval_loss=0.8485 duration_sec=14.27
2026-03-26 22:40:21,601 [INFO] new_opacus_codex.train_steps: epoch=3 step=475 loss=2.1674
2026-03-26 22:40:46,802 [INFO] new_opacus_codex.train_steps: epoch=3 step=480 loss=2.1513
2026-03-26 22:41:01,163 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=480 eval_loss=0.8486 duration_sec=14.35
2026-03-26 22:41:28,432 [INFO] new_opacus_codex.train_steps: epoch=3 step=485 loss=2.0010
2026-03-26 22:41:54,238 [INFO] new_opacus_codex.train_steps: epoch=3 step=490 loss=2.0321
2026-03-26 22:42:08,574 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=490 eval_loss=0.8485 duration_sec=14.32
2026-03-26 22:42:34,565 [INFO] new_opacus_codex.train_steps: epoch=3 step=495 loss=2.2499
2026-03-26 22:43:00,606 [INFO] new_opacus_codex.train_steps: epoch=3 step=500 loss=2.2952
2026-03-26 22:43:14,890 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=500 eval_loss=0.8485 duration_sec=14.27
2026-03-26 22:43:41,340 [INFO] new_opacus_codex.train_steps: epoch=3 step=505 loss=2.1940
2026-03-26 22:44:06,975 [INFO] new_opacus_codex.train_steps: epoch=3 step=510 loss=2.1671
2026-03-26 22:44:21,393 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=510 eval_loss=0.8485 duration_sec=14.39
2026-03-26 22:44:47,294 [INFO] new_opacus_codex.train_steps: epoch=3 step=515 loss=2.0725
2026-03-26 22:45:13,949 [INFO] new_opacus_codex.train_steps: epoch=3 step=520 loss=2.0577
2026-03-26 22:45:28,311 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=520 eval_loss=0.8486 duration_sec=14.35
2026-03-26 22:45:54,578 [INFO] new_opacus_codex.train_steps: epoch=3 step=525 loss=1.9688
2026-03-26 22:46:20,229 [INFO] new_opacus_codex.train_steps: epoch=3 step=530 loss=1.9090
2026-03-26 22:46:34,527 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=530 eval_loss=0.8485 duration_sec=14.29
2026-03-26 22:47:01,018 [INFO] new_opacus_codex.train_steps: epoch=3 step=535 loss=2.2408
2026-03-26 22:47:26,827 [INFO] new_opacus_codex.train_steps: epoch=3 step=540 loss=2.2717
2026-03-26 22:47:41,170 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=540 eval_loss=0.8485 duration_sec=14.33
2026-03-26 22:48:07,065 [INFO] new_opacus_codex.train_steps: epoch=3 step=545 loss=2.2006
2026-03-26 22:48:32,648 [INFO] new_opacus_codex.train_steps: epoch=3 step=550 loss=2.1780
2026-03-26 22:48:46,944 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=550 eval_loss=0.8485 duration_sec=14.28
2026-03-26 22:49:12,291 [INFO] new_opacus_codex.train_steps: epoch=3 step=555 loss=2.0125
2026-03-26 22:49:38,496 [INFO] new_opacus_codex.train_steps: epoch=3 step=560 loss=1.9844
2026-03-26 22:49:52,778 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=560 eval_loss=0.8485 duration_sec=14.27
2026-03-26 22:50:17,954 [INFO] new_opacus_codex.train_steps: epoch=3 step=565 loss=2.0798
2026-03-26 22:50:44,409 [INFO] new_opacus_codex.train_steps: epoch=3 step=570 loss=2.1226
2026-03-26 22:50:58,787 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=570 eval_loss=0.8485 duration_sec=14.37
2026-03-26 22:51:25,250 [INFO] new_opacus_codex.train_steps: epoch=3 step=575 loss=2.0833
2026-03-26 22:51:52,045 [INFO] new_opacus_codex.train_steps: epoch=3 step=580 loss=2.0278
2026-03-26 22:52:06,458 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=580 eval_loss=0.8486 duration_sec=14.40
2026-03-26 22:52:33,050 [INFO] new_opacus_codex.train_steps: epoch=3 step=585 loss=2.1341
2026-03-26 22:52:58,968 [INFO] new_opacus_codex.train_steps: epoch=3 step=590 loss=2.2731
2026-03-26 22:53:13,300 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=590 eval_loss=0.8485 duration_sec=14.32
2026-03-26 22:53:39,211 [INFO] new_opacus_codex.train_steps: epoch=3 step=595 loss=2.1922
2026-03-26 22:54:05,226 [INFO] new_opacus_codex.train_steps: epoch=3 step=600 loss=1.9313
2026-03-26 22:54:20,522 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=600 eval_loss=0.8486 duration_sec=15.29
2026-03-26 22:54:45,456 [INFO] new_opacus_codex.train_steps: epoch=3 step=605 loss=2.0718
2026-03-26 22:55:11,906 [INFO] new_opacus_codex.train_steps: epoch=3 step=610 loss=2.2035
2026-03-26 22:55:26,232 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=610 eval_loss=0.8485 duration_sec=14.31
2026-03-26 22:55:52,483 [INFO] new_opacus_codex.train_steps: epoch=3 step=615 loss=2.1000
2026-03-26 22:56:18,107 [INFO] new_opacus_codex.train_steps: epoch=3 step=620 loss=2.1004
2026-03-26 22:56:32,988 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=620 eval_loss=0.8485 duration_sec=14.86
|