Instructions to use melihcatal/codedp-cpt-models-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use melihcatal/codedp-cpt-models-v2 with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
| 2026-03-26 21:48:00,537 [INFO] new_opacus_codex.train_steps: epoch=1 step=5 loss=2.0965 | |
| 2026-03-26 21:48:26,199 [INFO] new_opacus_codex.train_steps: epoch=1 step=10 loss=2.0740 | |
| 2026-03-26 21:48:40,447 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=10 eval_loss=0.8438 duration_sec=14.23 | |
| 2026-03-26 21:49:05,242 [INFO] new_opacus_codex.train_steps: epoch=1 step=15 loss=2.0506 | |
| 2026-03-26 21:49:31,282 [INFO] new_opacus_codex.train_steps: epoch=1 step=20 loss=2.0738 | |
| 2026-03-26 21:49:45,556 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=20 eval_loss=0.8438 duration_sec=14.26 | |
| 2026-03-26 21:50:09,610 [INFO] new_opacus_codex.train_steps: epoch=1 step=25 loss=2.2433 | |
| 2026-03-26 21:50:35,960 [INFO] new_opacus_codex.train_steps: epoch=1 step=30 loss=2.1463 | |
| 2026-03-26 21:50:50,275 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=30 eval_loss=0.8438 duration_sec=14.30 | |
| 2026-03-26 21:51:14,645 [INFO] new_opacus_codex.train_steps: epoch=1 step=35 loss=1.8533 | |
| 2026-03-26 21:51:40,961 [INFO] new_opacus_codex.train_steps: epoch=1 step=40 loss=2.0047 | |
| 2026-03-26 21:51:55,282 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=40 eval_loss=0.8439 duration_sec=14.31 | |
| 2026-03-26 21:52:19,559 [INFO] new_opacus_codex.train_steps: epoch=1 step=45 loss=2.2556 | |
| 2026-03-26 21:52:46,158 [INFO] new_opacus_codex.train_steps: epoch=1 step=50 loss=2.0732 | |
| 2026-03-26 21:53:00,487 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=50 eval_loss=0.8442 duration_sec=14.32 | |
| 2026-03-26 21:53:25,407 [INFO] new_opacus_codex.train_steps: epoch=1 step=55 loss=1.8839 | |
| 2026-03-26 21:53:50,653 [INFO] new_opacus_codex.train_steps: epoch=1 step=60 loss=2.0296 | |
| 2026-03-26 21:54:04,964 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=60 eval_loss=0.8444 duration_sec=14.30 | |
| 2026-03-26 21:54:29,810 [INFO] new_opacus_codex.train_steps: epoch=1 step=65 loss=2.2169 | |
| 2026-03-26 21:54:55,208 [INFO] new_opacus_codex.train_steps: epoch=1 step=70 loss=2.2310 | |
| 2026-03-26 21:55:09,527 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=70 eval_loss=0.8447 duration_sec=14.31 | |
| 2026-03-26 21:55:35,398 [INFO] new_opacus_codex.train_steps: epoch=1 step=75 loss=2.3162 | |
| 2026-03-26 21:56:01,133 [INFO] new_opacus_codex.train_steps: epoch=1 step=80 loss=2.2552 | |
| 2026-03-26 21:56:15,596 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=80 eval_loss=0.8449 duration_sec=14.45 | |
| 2026-03-26 21:56:41,833 [INFO] new_opacus_codex.train_steps: epoch=1 step=85 loss=1.9948 | |
| 2026-03-26 21:57:06,859 [INFO] new_opacus_codex.train_steps: epoch=1 step=90 loss=2.0550 | |
| 2026-03-26 21:57:21,166 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=90 eval_loss=0.8452 duration_sec=14.29 | |
| 2026-03-26 21:57:47,112 [INFO] new_opacus_codex.train_steps: epoch=1 step=95 loss=2.1835 | |
| 2026-03-26 21:58:11,827 [INFO] new_opacus_codex.train_steps: epoch=1 step=100 loss=2.2523 | |
| 2026-03-26 21:58:26,216 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=100 eval_loss=0.8454 duration_sec=14.37 | |
| 2026-03-26 21:58:52,983 [INFO] new_opacus_codex.train_steps: epoch=1 step=105 loss=2.3648 | |
| 2026-03-26 21:59:18,099 [INFO] new_opacus_codex.train_steps: epoch=1 step=110 loss=2.2856 | |
| 2026-03-26 21:59:32,400 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=110 eval_loss=0.8456 duration_sec=14.29 | |
| 2026-03-26 21:59:58,833 [INFO] new_opacus_codex.train_steps: epoch=1 step=115 loss=2.0431 | |
| 2026-03-26 22:00:23,998 [INFO] new_opacus_codex.train_steps: epoch=1 step=120 loss=2.0698 | |
| 2026-03-26 22:00:38,689 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=120 eval_loss=0.8458 duration_sec=14.67 | |
| 2026-03-26 22:01:04,075 [INFO] new_opacus_codex.train_steps: epoch=1 step=125 loss=2.2100 | |
| 2026-03-26 22:01:29,671 [INFO] new_opacus_codex.train_steps: epoch=1 step=130 loss=2.0161 | |
| 2026-03-26 22:01:43,967 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=130 eval_loss=0.8460 duration_sec=14.28 | |
| 2026-03-26 22:02:08,901 [INFO] new_opacus_codex.train_steps: epoch=1 step=135 loss=2.0243 | |
| 2026-03-26 22:02:35,010 [INFO] new_opacus_codex.train_steps: epoch=1 step=140 loss=2.2631 | |
| 2026-03-26 22:02:49,318 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=140 eval_loss=0.8463 duration_sec=14.30 | |
| 2026-03-26 22:03:14,157 [INFO] new_opacus_codex.train_steps: epoch=1 step=145 loss=2.1024 | |
| 2026-03-26 22:03:40,409 [INFO] new_opacus_codex.train_steps: epoch=1 step=150 loss=2.0706 | |
| 2026-03-26 22:03:54,760 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=150 eval_loss=0.8465 duration_sec=14.34 | |
| 2026-03-26 22:04:19,526 [INFO] new_opacus_codex.train_steps: epoch=1 step=155 loss=1.9638 | |
| 2026-03-26 22:04:46,782 [INFO] new_opacus_codex.train_steps: epoch=1 step=160 loss=1.8354 | |
| 2026-03-26 22:05:01,123 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=160 eval_loss=0.8467 duration_sec=14.33 | |
| 2026-03-26 22:05:25,948 [INFO] new_opacus_codex.train_steps: epoch=1 step=165 loss=2.1498 | |
| 2026-03-26 22:05:52,494 [INFO] new_opacus_codex.train_steps: epoch=1 step=170 loss=2.1747 | |
| 2026-03-26 22:06:06,791 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=170 eval_loss=0.8468 duration_sec=14.29 | |
| 2026-03-26 22:06:31,988 [INFO] new_opacus_codex.train_steps: epoch=1 step=175 loss=1.9538 | |
| 2026-03-26 22:06:58,174 [INFO] new_opacus_codex.train_steps: epoch=1 step=180 loss=2.1538 | |
| 2026-03-26 22:07:12,518 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=180 eval_loss=0.8469 duration_sec=14.33 | |
| 2026-03-26 22:07:38,447 [INFO] new_opacus_codex.train_steps: epoch=1 step=185 loss=2.2217 | |
| 2026-03-26 22:08:03,246 [INFO] new_opacus_codex.train_steps: epoch=1 step=190 loss=2.1630 | |
| 2026-03-26 22:08:17,670 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=190 eval_loss=0.8470 duration_sec=14.41 | |
| 2026-03-26 22:08:43,461 [INFO] new_opacus_codex.train_steps: epoch=1 step=195 loss=2.2596 | |
| 2026-03-26 22:09:08,714 [INFO] new_opacus_codex.train_steps: epoch=1 step=200 loss=2.0670 | |
| 2026-03-26 22:09:22,975 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=1 step=200 eval_loss=0.8472 duration_sec=14.25 | |
| 2026-03-26 22:09:49,270 [INFO] new_opacus_codex.train_steps: epoch=1 step=205 loss=1.9729 | |
| 2026-03-26 22:10:36,026 [INFO] new_opacus_codex.train_steps: epoch=2 step=210 loss=1.6895 | |
| 2026-03-26 22:10:50,297 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=210 eval_loss=0.8473 duration_sec=14.27 | |
| 2026-03-26 22:11:18,438 [INFO] new_opacus_codex.train_steps: epoch=2 step=215 loss=2.0352 | |
| 2026-03-26 22:11:43,395 [INFO] new_opacus_codex.train_steps: epoch=2 step=220 loss=2.1403 | |
| 2026-03-26 22:11:57,748 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=220 eval_loss=0.8475 duration_sec=14.33 | |
| 2026-03-26 22:12:25,218 [INFO] new_opacus_codex.train_steps: epoch=2 step=225 loss=1.9779 | |
| 2026-03-26 22:12:50,777 [INFO] new_opacus_codex.train_steps: epoch=2 step=230 loss=1.9893 | |
| 2026-03-26 22:13:05,129 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=230 eval_loss=0.8476 duration_sec=14.34 | |
| 2026-03-26 22:13:31,262 [INFO] new_opacus_codex.train_steps: epoch=2 step=235 loss=2.1829 | |
| 2026-03-26 22:13:56,472 [INFO] new_opacus_codex.train_steps: epoch=2 step=240 loss=2.2177 | |
| 2026-03-26 22:14:10,802 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=240 eval_loss=0.8478 duration_sec=14.32 | |
| 2026-03-26 22:14:36,990 [INFO] new_opacus_codex.train_steps: epoch=2 step=245 loss=2.0736 | |
| 2026-03-26 22:15:03,086 [INFO] new_opacus_codex.train_steps: epoch=2 step=250 loss=2.0885 | |
| 2026-03-26 22:15:17,401 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=250 eval_loss=0.8478 duration_sec=14.31 | |
| 2026-03-26 22:15:42,970 [INFO] new_opacus_codex.train_steps: epoch=2 step=255 loss=2.0921 | |
| 2026-03-26 22:16:09,177 [INFO] new_opacus_codex.train_steps: epoch=2 step=260 loss=1.9281 | |
| 2026-03-26 22:16:23,522 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=260 eval_loss=0.8479 duration_sec=14.34 | |
| 2026-03-26 22:16:47,939 [INFO] new_opacus_codex.train_steps: epoch=2 step=265 loss=1.9739 | |
| 2026-03-26 22:17:14,402 [INFO] new_opacus_codex.train_steps: epoch=2 step=270 loss=2.0684 | |
| 2026-03-26 22:17:28,772 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=270 eval_loss=0.8479 duration_sec=14.35 | |
| 2026-03-26 22:17:53,651 [INFO] new_opacus_codex.train_steps: epoch=2 step=275 loss=2.0468 | |
| 2026-03-26 22:18:20,927 [INFO] new_opacus_codex.train_steps: epoch=2 step=280 loss=2.0730 | |
| 2026-03-26 22:18:35,289 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=280 eval_loss=0.8480 duration_sec=14.36 | |
| 2026-03-26 22:18:59,920 [INFO] new_opacus_codex.train_steps: epoch=2 step=285 loss=2.1239 | |
| 2026-03-26 22:19:26,647 [INFO] new_opacus_codex.train_steps: epoch=2 step=290 loss=2.1021 | |
| 2026-03-26 22:19:41,002 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=290 eval_loss=0.8481 duration_sec=14.34 | |
| 2026-03-26 22:20:05,716 [INFO] new_opacus_codex.train_steps: epoch=2 step=295 loss=2.0917 | |
| 2026-03-26 22:20:32,362 [INFO] new_opacus_codex.train_steps: epoch=2 step=300 loss=2.1965 | |
| 2026-03-26 22:20:46,671 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=300 eval_loss=0.8482 duration_sec=14.29 | |
| 2026-03-26 22:21:12,696 [INFO] new_opacus_codex.train_steps: epoch=2 step=305 loss=2.2463 | |
| 2026-03-26 22:21:38,998 [INFO] new_opacus_codex.train_steps: epoch=2 step=310 loss=2.2180 | |
| 2026-03-26 22:21:53,277 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=310 eval_loss=0.8483 duration_sec=14.19 | |
| 2026-03-26 22:22:18,995 [INFO] new_opacus_codex.train_steps: epoch=2 step=315 loss=2.1983 | |
| 2026-03-26 22:22:45,042 [INFO] new_opacus_codex.train_steps: epoch=2 step=320 loss=2.2371 | |
| 2026-03-26 22:22:59,353 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=320 eval_loss=0.8483 duration_sec=14.18 | |
| 2026-03-26 22:23:24,821 [INFO] new_opacus_codex.train_steps: epoch=2 step=325 loss=2.2170 | |
| 2026-03-26 22:23:50,396 [INFO] new_opacus_codex.train_steps: epoch=2 step=330 loss=2.1873 | |
| 2026-03-26 22:24:04,699 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=330 eval_loss=0.8483 duration_sec=14.27 | |
| 2026-03-26 22:24:30,379 [INFO] new_opacus_codex.train_steps: epoch=2 step=335 loss=2.0300 | |
| 2026-03-26 22:24:55,644 [INFO] new_opacus_codex.train_steps: epoch=2 step=340 loss=1.8838 | |
| 2026-03-26 22:25:10,057 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=340 eval_loss=0.8483 duration_sec=14.33 | |
| 2026-03-26 22:25:37,334 [INFO] new_opacus_codex.train_steps: epoch=2 step=345 loss=1.8080 | |
| 2026-03-26 22:26:02,703 [INFO] new_opacus_codex.train_steps: epoch=2 step=350 loss=1.8755 | |
| 2026-03-26 22:26:17,134 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=350 eval_loss=0.8484 duration_sec=14.29 | |
| 2026-03-26 22:26:42,829 [INFO] new_opacus_codex.train_steps: epoch=2 step=355 loss=2.2443 | |
| 2026-03-26 22:27:09,316 [INFO] new_opacus_codex.train_steps: epoch=2 step=360 loss=2.2905 | |
| 2026-03-26 22:27:23,679 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=360 eval_loss=0.8484 duration_sec=14.23 | |
| 2026-03-26 22:27:49,780 [INFO] new_opacus_codex.train_steps: epoch=2 step=365 loss=2.0968 | |
| 2026-03-26 22:28:16,933 [INFO] new_opacus_codex.train_steps: epoch=2 step=370 loss=2.0605 | |
| 2026-03-26 22:28:31,266 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=370 eval_loss=0.8484 duration_sec=14.22 | |
| 2026-03-26 22:28:57,578 [INFO] new_opacus_codex.train_steps: epoch=2 step=375 loss=1.9692 | |
| 2026-03-26 22:29:23,374 [INFO] new_opacus_codex.train_steps: epoch=2 step=380 loss=1.9666 | |
| 2026-03-26 22:29:37,658 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=380 eval_loss=0.8484 duration_sec=14.27 | |
| 2026-03-26 22:30:03,251 [INFO] new_opacus_codex.train_steps: epoch=2 step=385 loss=2.1136 | |
| 2026-03-26 22:30:29,099 [INFO] new_opacus_codex.train_steps: epoch=2 step=390 loss=2.0575 | |
| 2026-03-26 22:30:43,419 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=390 eval_loss=0.8485 duration_sec=14.31 | |
| 2026-03-26 22:31:08,447 [INFO] new_opacus_codex.train_steps: epoch=2 step=395 loss=1.9648 | |
| 2026-03-26 22:31:34,603 [INFO] new_opacus_codex.train_steps: epoch=2 step=400 loss=1.9254 | |
| 2026-03-26 22:31:48,992 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=400 eval_loss=0.8485 duration_sec=14.38 | |
| 2026-03-26 22:32:14,838 [INFO] new_opacus_codex.train_steps: epoch=2 step=405 loss=1.9092 | |
| 2026-03-26 22:32:41,251 [INFO] new_opacus_codex.train_steps: epoch=2 step=410 loss=2.0961 | |
| 2026-03-26 22:32:55,643 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=2 step=410 eval_loss=0.8485 duration_sec=14.38 | |
| 2026-03-26 22:33:21,206 [INFO] new_opacus_codex.train_steps: epoch=2 step=415 loss=2.0573 | |
| 2026-03-26 22:34:10,008 [INFO] new_opacus_codex.train_steps: epoch=3 step=420 loss=2.1745 | |
| 2026-03-26 22:34:24,248 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=420 eval_loss=0.8485 duration_sec=14.23 | |
| 2026-03-26 22:34:50,540 [INFO] new_opacus_codex.train_steps: epoch=3 step=425 loss=1.9973 | |
| 2026-03-26 22:35:16,937 [INFO] new_opacus_codex.train_steps: epoch=3 step=430 loss=1.9931 | |
| 2026-03-26 22:35:31,303 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=430 eval_loss=0.8485 duration_sec=14.34 | |
| 2026-03-26 22:35:56,917 [INFO] new_opacus_codex.train_steps: epoch=3 step=435 loss=1.9746 | |
| 2026-03-26 22:36:23,153 [INFO] new_opacus_codex.train_steps: epoch=3 step=440 loss=1.7725 | |
| 2026-03-26 22:36:37,474 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=440 eval_loss=0.8485 duration_sec=14.31 | |
| 2026-03-26 22:37:02,886 [INFO] new_opacus_codex.train_steps: epoch=3 step=445 loss=1.8957 | |
| 2026-03-26 22:37:29,749 [INFO] new_opacus_codex.train_steps: epoch=3 step=450 loss=2.1199 | |
| 2026-03-26 22:37:43,999 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=450 eval_loss=0.8485 duration_sec=14.24 | |
| 2026-03-26 22:38:09,178 [INFO] new_opacus_codex.train_steps: epoch=3 step=455 loss=2.2018 | |
| 2026-03-26 22:38:36,024 [INFO] new_opacus_codex.train_steps: epoch=3 step=460 loss=2.1082 | |
| 2026-03-26 22:38:50,333 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=460 eval_loss=0.8485 duration_sec=14.29 | |
| 2026-03-26 22:39:15,251 [INFO] new_opacus_codex.train_steps: epoch=3 step=465 loss=2.0519 | |
| 2026-03-26 22:39:41,508 [INFO] new_opacus_codex.train_steps: epoch=3 step=470 loss=2.1447 | |
| 2026-03-26 22:39:55,793 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=470 eval_loss=0.8485 duration_sec=14.27 | |
| 2026-03-26 22:40:21,601 [INFO] new_opacus_codex.train_steps: epoch=3 step=475 loss=2.1674 | |
| 2026-03-26 22:40:46,802 [INFO] new_opacus_codex.train_steps: epoch=3 step=480 loss=2.1513 | |
| 2026-03-26 22:41:01,163 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=480 eval_loss=0.8486 duration_sec=14.35 | |
| 2026-03-26 22:41:28,432 [INFO] new_opacus_codex.train_steps: epoch=3 step=485 loss=2.0010 | |
| 2026-03-26 22:41:54,238 [INFO] new_opacus_codex.train_steps: epoch=3 step=490 loss=2.0321 | |
| 2026-03-26 22:42:08,574 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=490 eval_loss=0.8485 duration_sec=14.32 | |
| 2026-03-26 22:42:34,565 [INFO] new_opacus_codex.train_steps: epoch=3 step=495 loss=2.2499 | |
| 2026-03-26 22:43:00,606 [INFO] new_opacus_codex.train_steps: epoch=3 step=500 loss=2.2952 | |
| 2026-03-26 22:43:14,890 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=500 eval_loss=0.8485 duration_sec=14.27 | |
| 2026-03-26 22:43:41,340 [INFO] new_opacus_codex.train_steps: epoch=3 step=505 loss=2.1940 | |
| 2026-03-26 22:44:06,975 [INFO] new_opacus_codex.train_steps: epoch=3 step=510 loss=2.1671 | |
| 2026-03-26 22:44:21,393 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=510 eval_loss=0.8485 duration_sec=14.39 | |
| 2026-03-26 22:44:47,294 [INFO] new_opacus_codex.train_steps: epoch=3 step=515 loss=2.0725 | |
| 2026-03-26 22:45:13,949 [INFO] new_opacus_codex.train_steps: epoch=3 step=520 loss=2.0577 | |
| 2026-03-26 22:45:28,311 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=520 eval_loss=0.8486 duration_sec=14.35 | |
| 2026-03-26 22:45:54,578 [INFO] new_opacus_codex.train_steps: epoch=3 step=525 loss=1.9688 | |
| 2026-03-26 22:46:20,229 [INFO] new_opacus_codex.train_steps: epoch=3 step=530 loss=1.9090 | |
| 2026-03-26 22:46:34,527 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=530 eval_loss=0.8485 duration_sec=14.29 | |
| 2026-03-26 22:47:01,018 [INFO] new_opacus_codex.train_steps: epoch=3 step=535 loss=2.2408 | |
| 2026-03-26 22:47:26,827 [INFO] new_opacus_codex.train_steps: epoch=3 step=540 loss=2.2717 | |
| 2026-03-26 22:47:41,170 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=540 eval_loss=0.8485 duration_sec=14.33 | |
| 2026-03-26 22:48:07,065 [INFO] new_opacus_codex.train_steps: epoch=3 step=545 loss=2.2006 | |
| 2026-03-26 22:48:32,648 [INFO] new_opacus_codex.train_steps: epoch=3 step=550 loss=2.1780 | |
| 2026-03-26 22:48:46,944 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=550 eval_loss=0.8485 duration_sec=14.28 | |
| 2026-03-26 22:49:12,291 [INFO] new_opacus_codex.train_steps: epoch=3 step=555 loss=2.0125 | |
| 2026-03-26 22:49:38,496 [INFO] new_opacus_codex.train_steps: epoch=3 step=560 loss=1.9844 | |
| 2026-03-26 22:49:52,778 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=560 eval_loss=0.8485 duration_sec=14.27 | |
| 2026-03-26 22:50:17,954 [INFO] new_opacus_codex.train_steps: epoch=3 step=565 loss=2.0798 | |
| 2026-03-26 22:50:44,409 [INFO] new_opacus_codex.train_steps: epoch=3 step=570 loss=2.1226 | |
| 2026-03-26 22:50:58,787 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=570 eval_loss=0.8485 duration_sec=14.37 | |
| 2026-03-26 22:51:25,250 [INFO] new_opacus_codex.train_steps: epoch=3 step=575 loss=2.0833 | |
| 2026-03-26 22:51:52,045 [INFO] new_opacus_codex.train_steps: epoch=3 step=580 loss=2.0278 | |
| 2026-03-26 22:52:06,458 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=580 eval_loss=0.8486 duration_sec=14.40 | |
| 2026-03-26 22:52:33,050 [INFO] new_opacus_codex.train_steps: epoch=3 step=585 loss=2.1341 | |
| 2026-03-26 22:52:58,968 [INFO] new_opacus_codex.train_steps: epoch=3 step=590 loss=2.2731 | |
| 2026-03-26 22:53:13,300 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=590 eval_loss=0.8485 duration_sec=14.32 | |
| 2026-03-26 22:53:39,211 [INFO] new_opacus_codex.train_steps: epoch=3 step=595 loss=2.1922 | |
| 2026-03-26 22:54:05,226 [INFO] new_opacus_codex.train_steps: epoch=3 step=600 loss=1.9313 | |
| 2026-03-26 22:54:20,522 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=600 eval_loss=0.8486 duration_sec=15.29 | |
| 2026-03-26 22:54:45,456 [INFO] new_opacus_codex.train_steps: epoch=3 step=605 loss=2.0718 | |
| 2026-03-26 22:55:11,906 [INFO] new_opacus_codex.train_steps: epoch=3 step=610 loss=2.2035 | |
| 2026-03-26 22:55:26,232 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=610 eval_loss=0.8485 duration_sec=14.31 | |
| 2026-03-26 22:55:52,483 [INFO] new_opacus_codex.train_steps: epoch=3 step=615 loss=2.1000 | |
| 2026-03-26 22:56:18,107 [INFO] new_opacus_codex.train_steps: epoch=3 step=620 loss=2.1004 | |
| 2026-03-26 22:56:32,988 [INFO] new_opacus_codex.train_steps: eval event=eval_step epoch=3 step=620 eval_loss=0.8485 duration_sec=14.86 | |