File size: 32,795 Bytes
f3ccc64 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 | 05/12/2026 04:15:00 - INFO - accelerate.utils.modeling - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
05/12/2026 04:15:34 - INFO - root - number of sec samples before upsampling: 1806
05/12/2026 04:15:34 - INFO - root - number of sec samples after upsampling: 2394
05/12/2026 04:15:37 - INFO - root - Training args Namespace(output_name='codellama-7b-lora-safecoder', datasets=['evol', 'sec-desc', 'sec-new-desc'], pretrain_name='codellama-7b', loss_weight=1.0, sven=False, num_train_epochs=5, learning_rate=0.001, max_num_tokens=1024, batch_size=1, grad_acc_steps=16, weight_decay=0.01, adam_epsilon=1e-08, warmup_steps=0, max_grad_norm=1.0, dropout=0.1, kl_loss_weight=0, exclude_neg=False, no_weights=False, lora=True, r=16, lora_alpha=32, lora_dropout=0.1, sampling_size=20, sampling_method='minority', cwes=['all'], langs=['all'], logging_steps=50, save_epochs=10, seed=2, data_dir='../data_train_val', model_dir='../trained/', output_dir='../trained/codellama-7b-lora-safecoder', logger=<RootLogger root (INFO)>)
05/12/2026 04:15:37 - INFO - root - ***** Running training *****
05/12/2026 04:15:37 - INFO - root - Num samples = 30692
05/12/2026 04:15:37 - INFO - root - Num epoch = 5
05/12/2026 04:15:37 - INFO - root - Batch size= 1
05/12/2026 04:15:37 - INFO - root - Total batch size (w. accumulation) = 16
05/12/2026 04:15:37 - INFO - root - Gradient Accumulation steps = 16
05/12/2026 04:15:37 - INFO - root - Total optimization steps = 9590
05/12/2026 04:15:37 - INFO - root - Num val samples = 3339
05/12/2026 04:15:37 - INFO - root - Num parameters = 6779101440
05/12/2026 04:15:37 - INFO - root - Num trainable parameters = 40554752
05/12/2026 04:21:02 - INFO - root - epochs: 1/5, steps: 50/9590, func: 0.050948, pos: 0.066863, neg: 0.102675, 0%: 17h 13m 50s
05/12/2026 04:26:32 - INFO - root - epochs: 1/5, steps: 100/9590, func: 0.047577, pos: 0.074183, neg: 0.041983, 1%: 17h 15m 36s
05/12/2026 04:31:56 - INFO - root - epochs: 1/5, steps: 150/9590, func: 0.048479, pos: 0.083375, neg: 0.044575, 1%: 17h 7m 9s
05/12/2026 04:37:28 - INFO - root - epochs: 1/5, steps: 200/9590, func: 0.049801, pos: 0.100972, neg: 0.029368, 2%: 17h 5m 57s
05/12/2026 04:42:57 - INFO - root - epochs: 1/5, steps: 250/9590, func: 0.048792, pos: 0.109971, neg: 0.047552, 2%: 17h 1m 3s
05/12/2026 04:48:33 - INFO - root - epochs: 1/5, steps: 300/9590, func: 0.050624, pos: 0.096543, neg: 0.028878, 3%: 17h 0m 26s
05/12/2026 04:53:57 - INFO - root - epochs: 1/5, steps: 350/9590, func: 0.049892, pos: 0.131824, neg: 0.033636, 3%: 16h 52m 16s
05/12/2026 04:59:21 - INFO - root - epochs: 1/5, steps: 400/9590, func: 0.051175, pos: 0.098686, neg: 0.026805, 4%: 16h 45m 38s
05/12/2026 05:04:53 - INFO - root - epochs: 1/5, steps: 450/9590, func: 0.050767, pos: 0.132716, neg: 0.029258, 4%: 16h 40m 58s
05/12/2026 05:10:22 - INFO - root - epochs: 1/5, steps: 500/9590, func: 0.051894, pos: 0.139853, neg: 0.040111, 5%: 16h 35m 38s
05/12/2026 05:15:46 - INFO - root - epochs: 1/5, steps: 550/9590, func: 0.050903, pos: 0.099576, neg: 0.036543, 5%: 16h 28m 50s
05/12/2026 05:21:17 - INFO - root - epochs: 1/5, steps: 600/9590, func: 0.050571, pos: 0.105049, neg: 0.035279, 6%: 16h 24m 7s
05/12/2026 05:26:49 - INFO - root - epochs: 1/5, steps: 650/9590, func: 0.051513, pos: 0.08535, neg: 0.026287, 6%: 16h 19m 19s
05/12/2026 05:32:16 - INFO - root - epochs: 1/5, steps: 700/9590, func: 0.051382, pos: 0.11401, neg: 0.022054, 7%: 16h 13m 37s
05/12/2026 05:37:42 - INFO - root - epochs: 1/5, steps: 750/9590, func: 0.052051, pos: 0.110181, neg: 0.022295, 7%: 16h 7m 50s
05/12/2026 05:41:40 - INFO - accelerate.utils.modeling - We will use 90% of the memory on device 0 for storing the model, and 10% for the buffer to avoid OOM. You can set `max_memory` in to a higher value to use more memory (at your own risk).
05/12/2026 05:42:09 - INFO - root - number of sec samples before upsampling: 1806
05/12/2026 05:42:09 - INFO - root - number of sec samples after upsampling: 2394
05/12/2026 05:42:12 - INFO - root - Training args Namespace(output_name='codellama-7b-lora-safecoder', datasets=['evol', 'sec-desc', 'sec-new-desc'], pretrain_name='codellama-7b', loss_weight=1.0, sven=False, num_train_epochs=5, learning_rate=0.001, max_num_tokens=1024, batch_size=1, grad_acc_steps=16, weight_decay=0.01, adam_epsilon=1e-08, warmup_steps=0, max_grad_norm=1.0, dropout=0.1, kl_loss_weight=0, exclude_neg=False, no_weights=False, lora=True, r=16, lora_alpha=32, lora_dropout=0.1, sampling_size=20, sampling_method='minority', cwes=['all'], langs=['all'], logging_steps=50, save_epochs=10, seed=2, data_dir='../data_train_val', model_dir='../trained/', output_dir='../trained/codellama-7b-lora-safecoder', logger=<RootLogger root (INFO)>)
05/12/2026 05:42:12 - INFO - root - ***** Running training *****
05/12/2026 05:42:12 - INFO - root - Num samples = 30692
05/12/2026 05:42:12 - INFO - root - Num epoch = 5
05/12/2026 05:42:12 - INFO - root - Batch size= 1
05/12/2026 05:42:12 - INFO - root - Total batch size (w. accumulation) = 16
05/12/2026 05:42:12 - INFO - root - Gradient Accumulation steps = 16
05/12/2026 05:42:12 - INFO - root - Total optimization steps = 9590
05/12/2026 05:42:12 - INFO - root - Num val samples = 3339
05/12/2026 05:42:12 - INFO - root - Num parameters = 6779101440
05/12/2026 05:42:12 - INFO - root - Num trainable parameters = 40554752
05/12/2026 05:46:16 - INFO - root - epochs: 1/5, steps: 50/9590, func: 0.05094, pos: 0.066244, neg: 0.104063, 0%: 12h 56m 45s
05/12/2026 05:50:23 - INFO - root - epochs: 1/5, steps: 100/9590, func: 0.049307, pos: 0.077082, neg: 0.04401, 1%: 12h 55m 55s
05/12/2026 05:54:25 - INFO - root - epochs: 1/5, steps: 150/9590, func: 0.051967, pos: 0.102716, neg: 0.041288, 1%: 12h 48m 36s
05/12/2026 05:58:31 - INFO - root - epochs: 1/5, steps: 200/9590, func: 0.04968, pos: 0.089183, neg: 0.03749, 2%: 12h 45m 58s
05/12/2026 06:02:37 - INFO - root - epochs: 1/5, steps: 250/9590, func: 0.048648, pos: 0.089534, neg: 0.056231, 2%: 12h 42m 50s
05/12/2026 06:06:49 - INFO - root - epochs: 1/5, steps: 300/9590, func: 0.050689, pos: 0.087186, neg: 0.035415, 3%: 12h 42m 28s
05/12/2026 06:10:50 - INFO - root - epochs: 1/5, steps: 350/9590, func: 0.050049, pos: 0.11521, neg: 0.036819, 3%: 12h 36m 12s
05/12/2026 06:14:51 - INFO - root - epochs: 1/5, steps: 400/9590, func: 0.051514, pos: 0.094837, neg: 0.029605, 4%: 12h 30m 41s
05/12/2026 06:18:58 - INFO - root - epochs: 1/5, steps: 450/9590, func: 0.050486, pos: 0.142797, neg: 0.023274, 4%: 12h 27m 11s
05/12/2026 06:23:02 - INFO - root - epochs: 1/5, steps: 500/9590, func: 0.052427, pos: 0.131603, neg: 0.040929, 5%: 12h 22m 24s
05/12/2026 06:27:00 - INFO - root - epochs: 1/5, steps: 550/9590, func: 0.051054, pos: 0.111602, neg: 0.036933, 5%: 12h 16m 23s
05/12/2026 06:31:03 - INFO - root - epochs: 1/5, steps: 600/9590, func: 0.050244, pos: 0.093686, neg: 0.032851, 6%: 12h 11m 56s
05/12/2026 06:35:05 - INFO - root - epochs: 1/5, steps: 650/9590, func: 0.051575, pos: 0.093165, neg: 0.028331, 6%: 12h 7m 26s
05/12/2026 06:39:06 - INFO - root - epochs: 1/5, steps: 700/9590, func: 0.051052, pos: 0.127034, neg: 0.017673, 7%: 12h 2m 36s
05/12/2026 06:43:09 - INFO - root - epochs: 1/5, steps: 750/9590, func: 0.051672, pos: 0.106718, neg: 0.021583, 7%: 11h 58m 27s
05/12/2026 06:47:11 - INFO - root - epochs: 1/5, steps: 800/9590, func: 0.052223, pos: 0.199383, neg: 0.018326, 8%: 11h 54m 17s
05/12/2026 06:51:14 - INFO - root - epochs: 1/5, steps: 850/9590, func: 0.051818, pos: 0.131503, neg: 0.017953, 8%: 11h 49m 55s
05/12/2026 06:55:19 - INFO - root - epochs: 1/5, steps: 900/9590, func: 0.051805, pos: 0.095854, neg: 0.031777, 9%: 11h 46m 2s
05/12/2026 06:59:20 - INFO - root - epochs: 1/5, steps: 950/9590, func: 0.051391, pos: 0.114702, neg: 0.024798, 9%: 11h 41m 38s
05/12/2026 07:03:21 - INFO - root - epochs: 1/5, steps: 1000/9590, func: 0.052208, pos: 0.105047, neg: 0.025137, 10%: 11h 37m 13s
05/12/2026 07:07:24 - INFO - root - epochs: 1/5, steps: 1050/9590, func: 0.052544, pos: 0.109283, neg: 0.023481, 10%: 11h 33m 8s
05/12/2026 07:11:27 - INFO - root - epochs: 1/5, steps: 1100/9590, func: 0.052499, pos: 0.074911, neg: 0.017531, 11%: 11h 28m 54s
05/12/2026 07:15:30 - INFO - root - epochs: 1/5, steps: 1150/9590, func: 0.052749, pos: 0.074735, neg: 0.034612, 11%: 11h 24m 47s
05/12/2026 07:19:29 - INFO - root - epochs: 1/5, steps: 1200/9590, func: 0.05158, pos: 0.08104, neg: 0.030226, 12%: 11h 20m 21s
05/12/2026 07:23:31 - INFO - root - epochs: 1/5, steps: 1250/9590, func: 0.05273, pos: 0.112143, neg: 0.023065, 13%: 11h 16m 6s
05/12/2026 07:27:33 - INFO - root - epochs: 1/5, steps: 1300/9590, func: 0.051978, pos: 0.0801, neg: 0.018668, 13%: 11h 11m 56s
05/12/2026 07:31:36 - INFO - root - epochs: 1/5, steps: 1350/9590, func: 0.053459, pos: 0.109475, neg: 0.021353, 14%: 11h 7m 53s
05/12/2026 07:35:37 - INFO - root - epochs: 1/5, steps: 1400/9590, func: 0.051982, pos: 0.099747, neg: 0.017942, 14%: 11h 3m 32s
05/12/2026 07:39:45 - INFO - root - epochs: 1/5, steps: 1450/9590, func: 0.052103, pos: 0.0787, neg: 0.018889, 15%: 10h 59m 57s
05/12/2026 07:43:53 - INFO - root - epochs: 1/5, steps: 1500/9590, func: 0.053136, pos: 0.086926, neg: 0.022841, 15%: 10h 56m 18s
05/12/2026 07:47:58 - INFO - root - epochs: 1/5, steps: 1550/9590, func: 0.051407, pos: 0.074462, neg: 0.033253, 16%: 10h 52m 29s
05/12/2026 07:52:02 - INFO - root - epochs: 1/5, steps: 1600/9590, func: 0.051884, pos: 0.113117, neg: 0.013997, 16%: 10h 48m 21s
05/12/2026 07:56:05 - INFO - root - epochs: 1/5, steps: 1650/9590, func: 0.051529, pos: 0.096809, neg: 0.034268, 17%: 10h 44m 22s
05/12/2026 08:00:07 - INFO - root - epochs: 1/5, steps: 1700/9590, func: 0.053094, pos: 0.094909, neg: 0.025515, 17%: 10h 40m 11s
05/12/2026 08:04:13 - INFO - root - epochs: 1/5, steps: 1750/9590, func: 0.051572, pos: 0.070656, neg: 0.018058, 18%: 10h 36m 16s
05/12/2026 08:08:14 - INFO - root - epochs: 1/5, steps: 1800/9590, func: 0.053385, pos: 0.081258, neg: 0.021151, 18%: 10h 32m 3s
05/12/2026 08:12:15 - INFO - root - epochs: 1/5, steps: 1850/9590, func: 0.051872, pos: 0.082147, neg: 0.025378, 19%: 10h 27m 52s
05/12/2026 08:16:14 - INFO - root - epochs: 1/5, steps: 1900/9590, func: 0.051771, pos: 0.069222, neg: 0.02802, 19%: 10h 23m 33s
05/12/2026 08:20:22 - INFO - root - epochs: 2/5, steps: 1950/9590, func: 0.049655, pos: 0.091716, neg: 0.017551, 20%: 10h 19m 51s
05/12/2026 08:24:27 - INFO - root - epochs: 2/5, steps: 2000/9590, func: 0.048194, pos: 0.089666, neg: 0.016199, 20%: 10h 15m 51s
05/12/2026 08:28:31 - INFO - root - epochs: 2/5, steps: 2050/9590, func: 0.047949, pos: 0.064842, neg: 0.016331, 21%: 10h 11m 50s
05/12/2026 08:32:25 - INFO - root - epochs: 2/5, steps: 2100/9590, func: 0.047398, pos: 0.05705, neg: 0.022757, 21%: 10h 7m 14s
05/12/2026 08:36:23 - INFO - root - epochs: 2/5, steps: 2150/9590, func: 0.046701, pos: 0.047835, neg: 0.007168, 22%: 10h 2m 49s
05/12/2026 08:40:20 - INFO - root - epochs: 2/5, steps: 2200/9590, func: 0.047965, pos: 0.075317, neg: 0.016316, 22%: 9h 58m 28s
05/12/2026 08:44:15 - INFO - root - epochs: 2/5, steps: 2250/9590, func: 0.049061, pos: 0.063372, neg: 0.013699, 23%: 9h 53m 59s
05/12/2026 08:48:25 - INFO - root - epochs: 2/5, steps: 2300/9590, func: 0.049057, pos: 0.048502, neg: 0.018374, 23%: 9h 50m 15s
05/12/2026 08:52:32 - INFO - root - epochs: 2/5, steps: 2350/9590, func: 0.048354, pos: 0.069101, neg: 0.026868, 24%: 9h 46m 26s
05/12/2026 08:56:38 - INFO - root - epochs: 2/5, steps: 2400/9590, func: 0.049917, pos: 0.054647, neg: 0.015619, 25%: 9h 42m 34s
05/12/2026 09:00:41 - INFO - root - epochs: 2/5, steps: 2450/9590, func: 0.049093, pos: 0.066494, neg: 0.018207, 25%: 9h 38m 31s
05/12/2026 09:04:46 - INFO - root - epochs: 2/5, steps: 2500/9590, func: 0.048869, pos: 0.050998, neg: 0.01707, 26%: 9h 34m 33s
05/12/2026 09:08:49 - INFO - root - epochs: 2/5, steps: 2550/9590, func: 0.049238, pos: 0.049718, neg: 0.021854, 26%: 9h 30m 32s
05/12/2026 09:12:54 - INFO - root - epochs: 2/5, steps: 2600/9590, func: 0.048473, pos: 0.048989, neg: 0.010893, 27%: 9h 26m 33s
05/12/2026 09:16:56 - INFO - root - epochs: 2/5, steps: 2650/9590, func: 0.047705, pos: 0.057008, neg: 0.013472, 27%: 9h 22m 26s
05/12/2026 09:20:51 - INFO - root - epochs: 2/5, steps: 2700/9590, func: 0.048697, pos: 0.051007, neg: 0.020675, 28%: 9h 18m 3s
05/12/2026 09:24:59 - INFO - root - epochs: 2/5, steps: 2750/9590, func: 0.049822, pos: 0.066192, neg: 0.019821, 28%: 9h 14m 14s
05/12/2026 09:29:03 - INFO - root - epochs: 2/5, steps: 2800/9590, func: 0.048675, pos: 0.053753, neg: 0.013854, 29%: 9h 10m 12s
05/12/2026 09:33:07 - INFO - root - epochs: 2/5, steps: 2850/9590, func: 0.049093, pos: 0.069586, neg: 0.016566, 29%: 9h 6m 10s
05/12/2026 09:37:04 - INFO - root - epochs: 2/5, steps: 2900/9590, func: 0.048524, pos: 0.070572, neg: 0.017197, 30%: 9h 1m 56s
05/12/2026 09:41:08 - INFO - root - epochs: 2/5, steps: 2950/9590, func: 0.049634, pos: 0.074833, neg: 0.014874, 30%: 8h 57m 53s
05/12/2026 09:45:11 - INFO - root - epochs: 2/5, steps: 3000/9590, func: 0.050054, pos: 0.087303, neg: 0.010392, 31%: 8h 53m 50s
05/12/2026 09:49:13 - INFO - root - epochs: 2/5, steps: 3050/9590, func: 0.050311, pos: 0.062351, neg: 0.018711, 31%: 8h 49m 46s
05/12/2026 09:53:16 - INFO - root - epochs: 2/5, steps: 3100/9590, func: 0.04941, pos: 0.055028, neg: 0.014512, 32%: 8h 45m 44s
05/12/2026 09:57:21 - INFO - root - epochs: 2/5, steps: 3150/9590, func: 0.048447, pos: 0.049718, neg: 0.010809, 32%: 8h 41m 45s
05/12/2026 10:01:26 - INFO - root - epochs: 2/5, steps: 3200/9590, func: 0.048828, pos: 0.04025, neg: 0.011925, 33%: 8h 37m 45s
05/12/2026 10:05:28 - INFO - root - epochs: 2/5, steps: 3250/9590, func: 0.049547, pos: 0.047331, neg: 0.016769, 33%: 8h 33m 40s
05/12/2026 10:09:32 - INFO - root - epochs: 2/5, steps: 3300/9590, func: 0.049657, pos: 0.052612, neg: 0.01622, 34%: 8h 29m 38s
05/12/2026 10:13:36 - INFO - root - epochs: 2/5, steps: 3350/9590, func: 0.048103, pos: 0.051177, neg: 0.016804, 34%: 8h 25m 37s
05/12/2026 10:17:43 - INFO - root - epochs: 2/5, steps: 3400/9590, func: 0.049741, pos: 0.046914, neg: 0.00997, 35%: 8h 21m 39s
05/12/2026 10:21:39 - INFO - root - epochs: 2/5, steps: 3450/9590, func: 0.049321, pos: 0.044237, neg: 0.021636, 35%: 8h 17m 24s
05/12/2026 10:25:41 - INFO - root - epochs: 2/5, steps: 3500/9590, func: 0.048902, pos: 0.054882, neg: 0.014377, 36%: 8h 13m 20s
05/12/2026 10:29:46 - INFO - root - epochs: 2/5, steps: 3550/9590, func: 0.048824, pos: 0.063807, neg: 0.014463, 37%: 8h 9m 20s
05/12/2026 10:33:45 - INFO - root - epochs: 2/5, steps: 3600/9590, func: 0.048231, pos: 0.058499, neg: 0.024831, 37%: 8h 5m 11s
05/12/2026 10:37:44 - INFO - root - epochs: 2/5, steps: 3650/9590, func: 0.048081, pos: 0.085183, neg: 0.014266, 38%: 8h 1m 1s
05/12/2026 10:41:45 - INFO - root - epochs: 2/5, steps: 3700/9590, func: 0.049161, pos: 0.052952, neg: 0.017932, 38%: 7h 56m 57s
05/12/2026 10:45:46 - INFO - root - epochs: 2/5, steps: 3750/9590, func: 0.0463, pos: 0.038567, neg: 0.024301, 39%: 7h 52m 49s
05/12/2026 10:49:48 - INFO - root - epochs: 2/5, steps: 3800/9590, func: 0.049297, pos: 0.069484, neg: 0.014716, 39%: 7h 48m 46s
05/12/2026 10:53:53 - INFO - root - epochs: 3/5, steps: 3850/9590, func: 0.046816, pos: 0.040204, neg: 0.01251, 40%: 7h 44m 46s
05/12/2026 10:58:02 - INFO - root - epochs: 3/5, steps: 3900/9590, func: 0.043129, pos: 0.026737, neg: 0.005862, 40%: 7h 40m 51s
05/12/2026 11:02:08 - INFO - root - epochs: 3/5, steps: 3950/9590, func: 0.042884, pos: 0.033689, neg: 0.018398, 41%: 7h 36m 55s
05/12/2026 11:06:16 - INFO - root - epochs: 3/5, steps: 4000/9590, func: 0.04335, pos: 0.033484, neg: 0.011237, 41%: 7h 32m 57s
05/12/2026 11:10:22 - INFO - root - epochs: 3/5, steps: 4050/9590, func: 0.043222, pos: 0.03106, neg: 0.009611, 42%: 7h 28m 58s
05/12/2026 11:14:26 - INFO - root - epochs: 3/5, steps: 4100/9590, func: 0.04263, pos: 0.024914, neg: 0.0074, 42%: 7h 24m 56s
05/12/2026 11:18:25 - INFO - root - epochs: 3/5, steps: 4150/9590, func: 0.042534, pos: 0.029838, neg: 0.00935, 43%: 7h 20m 48s
05/12/2026 11:22:26 - INFO - root - epochs: 3/5, steps: 4200/9590, func: 0.043006, pos: 0.023594, neg: 0.003426, 43%: 7h 16m 44s
05/12/2026 11:26:27 - INFO - root - epochs: 3/5, steps: 4250/9590, func: 0.043293, pos: 0.038818, neg: 0.0156, 44%: 7h 12m 37s
05/12/2026 11:30:28 - INFO - root - epochs: 3/5, steps: 4300/9590, func: 0.043924, pos: 0.026801, neg: 0.013046, 44%: 7h 8m 32s
05/12/2026 11:34:30 - INFO - root - epochs: 3/5, steps: 4350/9590, func: 0.043028, pos: 0.036002, neg: 0.007841, 45%: 7h 4m 27s
05/12/2026 11:38:26 - INFO - root - epochs: 3/5, steps: 4400/9590, func: 0.04386, pos: 0.032908, neg: 0.013913, 45%: 7h 0m 15s
05/12/2026 11:42:29 - INFO - root - epochs: 3/5, steps: 4450/9590, func: 0.044371, pos: 0.031435, neg: 0.011, 46%: 6h 56m 13s
05/12/2026 11:46:32 - INFO - root - epochs: 3/5, steps: 4500/9590, func: 0.043781, pos: 0.026892, neg: 0.004409, 46%: 6h 52m 11s
05/12/2026 11:50:35 - INFO - root - epochs: 3/5, steps: 4550/9590, func: 0.044126, pos: 0.03361, neg: 0.01439, 47%: 6h 48m 9s
05/12/2026 11:54:34 - INFO - root - epochs: 3/5, steps: 4600/9590, func: 0.043945, pos: 0.042274, neg: 0.011236, 47%: 6h 44m 0s
05/12/2026 11:58:32 - INFO - root - epochs: 3/5, steps: 4650/9590, func: 0.043599, pos: 0.020995, neg: 0.012674, 48%: 6h 39m 53s
05/12/2026 12:02:32 - INFO - root - epochs: 3/5, steps: 4700/9590, func: 0.043737, pos: 0.032744, neg: 0.007529, 48%: 6h 35m 47s
05/12/2026 12:06:32 - INFO - root - epochs: 3/5, steps: 4750/9590, func: 0.04433, pos: 0.018101, neg: 0.00986, 49%: 6h 31m 42s
05/12/2026 12:10:37 - INFO - root - epochs: 3/5, steps: 4800/9590, func: 0.044883, pos: 0.031611, neg: 0.007338, 50%: 6h 27m 42s
05/12/2026 12:14:40 - INFO - root - epochs: 3/5, steps: 4850/9590, func: 0.043752, pos: 0.036319, neg: 0.010377, 50%: 6h 23m 39s
05/12/2026 12:18:40 - INFO - root - epochs: 3/5, steps: 4900/9590, func: 0.043916, pos: 0.026421, neg: 0.008696, 51%: 6h 19m 34s
05/12/2026 12:22:44 - INFO - root - epochs: 3/5, steps: 4950/9590, func: 0.043717, pos: 0.036711, neg: 0.014496, 51%: 6h 15m 32s
05/12/2026 12:26:45 - INFO - root - epochs: 3/5, steps: 5000/9590, func: 0.044367, pos: 0.041459, neg: 0.005121, 52%: 6h 11m 27s
05/12/2026 12:30:49 - INFO - root - epochs: 3/5, steps: 5050/9590, func: 0.044552, pos: 0.030175, neg: 0.00822, 52%: 6h 7m 26s
05/12/2026 12:34:59 - INFO - root - epochs: 3/5, steps: 5100/9590, func: 0.043727, pos: 0.031545, neg: 0.017947, 53%: 6h 3m 29s
05/12/2026 12:39:08 - INFO - root - epochs: 3/5, steps: 5150/9590, func: 0.044665, pos: 0.020976, neg: 0.013421, 53%: 5h 59m 33s
05/12/2026 12:43:13 - INFO - root - epochs: 3/5, steps: 5200/9590, func: 0.04315, pos: 0.035845, neg: 0.009733, 54%: 5h 55m 30s
05/12/2026 12:47:15 - INFO - root - epochs: 3/5, steps: 5250/9590, func: 0.04466, pos: 0.037906, neg: 0.009849, 54%: 5h 51m 27s
05/12/2026 12:51:14 - INFO - root - epochs: 3/5, steps: 5300/9590, func: 0.044784, pos: 0.034, neg: 0.015596, 55%: 5h 47m 21s
05/12/2026 12:55:14 - INFO - root - epochs: 3/5, steps: 5350/9590, func: 0.043455, pos: 0.028191, neg: 0.010957, 55%: 5h 43m 16s
05/12/2026 12:59:16 - INFO - root - epochs: 3/5, steps: 5400/9590, func: 0.042788, pos: 0.030653, neg: 0.011688, 56%: 5h 39m 12s
05/12/2026 13:03:23 - INFO - root - epochs: 3/5, steps: 5450/9590, func: 0.044157, pos: 0.025905, neg: 0.009461, 56%: 5h 35m 13s
05/12/2026 13:07:32 - INFO - root - epochs: 3/5, steps: 5500/9590, func: 0.045079, pos: 0.027724, neg: 0.009764, 57%: 5h 31m 15s
05/12/2026 13:11:38 - INFO - root - epochs: 3/5, steps: 5550/9590, func: 0.043324, pos: 0.032279, neg: 0.014533, 57%: 5h 27m 13s
05/12/2026 13:15:42 - INFO - root - epochs: 3/5, steps: 5600/9590, func: 0.043653, pos: 0.035368, neg: 0.005618, 58%: 5h 23m 11s
05/12/2026 13:19:45 - INFO - root - epochs: 3/5, steps: 5650/9590, func: 0.043204, pos: 0.024611, neg: 0.017919, 58%: 5h 19m 9s
05/12/2026 13:23:46 - INFO - root - epochs: 3/5, steps: 5700/9590, func: 0.04316, pos: 0.018795, neg: 0.010624, 59%: 5h 15m 4s
05/12/2026 13:27:48 - INFO - root - epochs: 3/5, steps: 5750/9590, func: 0.044979, pos: 0.025781, neg: 0.020653, 59%: 5h 11m 1s
05/12/2026 13:31:50 - INFO - root - epochs: 4/5, steps: 5800/9590, func: 0.037187, pos: 0.028788, neg: 0.006066, 60%: 5h 6m 58s
05/12/2026 13:35:52 - INFO - root - epochs: 4/5, steps: 5850/9590, func: 0.037183, pos: 0.01936, neg: 0.004128, 60%: 5h 2m 53s
05/12/2026 13:39:56 - INFO - root - epochs: 4/5, steps: 5900/9590, func: 0.036845, pos: 0.010634, neg: 0.006401, 61%: 4h 58m 51s
05/12/2026 13:43:55 - INFO - root - epochs: 4/5, steps: 5950/9590, func: 0.035898, pos: 0.014306, neg: 0.001101, 62%: 4h 54m 46s
05/12/2026 13:48:01 - INFO - root - epochs: 4/5, steps: 6000/9590, func: 0.036924, pos: 0.021029, neg: 0.003658, 62%: 4h 50m 45s
05/12/2026 13:52:07 - INFO - root - epochs: 4/5, steps: 6050/9590, func: 0.037266, pos: 0.026165, neg: 0.007137, 63%: 4h 46m 44s
05/12/2026 13:56:05 - INFO - root - epochs: 4/5, steps: 6100/9590, func: 0.03719, pos: 0.013228, neg: 0.004765, 63%: 4h 42m 38s
05/12/2026 14:00:08 - INFO - root - epochs: 4/5, steps: 6150/9590, func: 0.037471, pos: 0.011627, neg: 0.003193, 64%: 4h 38m 36s
05/12/2026 14:04:14 - INFO - root - epochs: 4/5, steps: 6200/9590, func: 0.037742, pos: 0.008895, neg: 0.007303, 64%: 4h 34m 34s
05/12/2026 14:08:17 - INFO - root - epochs: 4/5, steps: 6250/9590, func: 0.037754, pos: 0.011739, neg: 0.004153, 65%: 4h 30m 32s
05/12/2026 14:12:13 - INFO - root - epochs: 4/5, steps: 6300/9590, func: 0.036989, pos: 0.01173, neg: 0.008266, 65%: 4h 26m 25s
05/12/2026 14:16:20 - INFO - root - epochs: 4/5, steps: 6350/9590, func: 0.037897, pos: 0.01403, neg: 0.005232, 66%: 4h 22m 24s
05/12/2026 14:20:26 - INFO - root - epochs: 4/5, steps: 6400/9590, func: 0.037609, pos: 0.016425, neg: 0.002059, 66%: 4h 18m 23s
05/12/2026 14:24:32 - INFO - root - epochs: 4/5, steps: 6450/9590, func: 0.038468, pos: 0.019632, neg: 0.013029, 67%: 4h 14m 21s
05/12/2026 14:28:32 - INFO - root - epochs: 4/5, steps: 6500/9590, func: 0.038036, pos: 0.011861, neg: 0.003423, 67%: 4h 10m 17s
05/12/2026 14:32:33 - INFO - root - epochs: 4/5, steps: 6550/9590, func: 0.037247, pos: 0.027833, neg: 0.005179, 68%: 4h 6m 13s
05/12/2026 14:36:40 - INFO - root - epochs: 4/5, steps: 6600/9590, func: 0.037224, pos: 0.015212, neg: 0.006781, 68%: 4h 2m 12s
05/12/2026 14:40:41 - INFO - root - epochs: 4/5, steps: 6650/9590, func: 0.038324, pos: 0.015693, neg: 0.005974, 69%: 3h 58m 8s
05/12/2026 14:44:45 - INFO - root - epochs: 4/5, steps: 6700/9590, func: 0.039012, pos: 0.014201, neg: 0.00685, 69%: 3h 54m 6s
05/12/2026 14:48:52 - INFO - root - epochs: 4/5, steps: 6750/9590, func: 0.037505, pos: 0.010282, neg: 0.002384, 70%: 3h 50m 5s
05/12/2026 14:52:58 - INFO - root - epochs: 4/5, steps: 6800/9590, func: 0.037557, pos: 0.007854, neg: 0.003328, 70%: 3h 46m 3s
05/12/2026 14:56:55 - INFO - root - epochs: 4/5, steps: 6850/9590, func: 0.037521, pos: 0.012866, neg: 0.003266, 71%: 3h 41m 58s
05/12/2026 15:00:56 - INFO - root - epochs: 4/5, steps: 6900/9590, func: 0.037562, pos: 0.004959, neg: 0.003749, 71%: 3h 37m 54s
05/12/2026 15:04:57 - INFO - root - epochs: 4/5, steps: 6950/9590, func: 0.038498, pos: 0.011588, neg: 0.004565, 72%: 3h 33m 50s
05/12/2026 15:09:04 - INFO - root - epochs: 4/5, steps: 7000/9590, func: 0.037594, pos: 0.016966, neg: 0.006965, 72%: 3h 29m 49s
05/12/2026 15:13:12 - INFO - root - epochs: 4/5, steps: 7050/9590, func: 0.039776, pos: 0.013159, neg: 0.006337, 73%: 3h 25m 48s
05/12/2026 15:17:14 - INFO - root - epochs: 4/5, steps: 7100/9590, func: 0.037827, pos: 0.011383, neg: 0.004707, 74%: 3h 21m 44s
05/12/2026 15:21:16 - INFO - root - epochs: 4/5, steps: 7150/9590, func: 0.038144, pos: 0.024477, neg: 0.008781, 74%: 3h 17m 41s
05/12/2026 15:25:23 - INFO - root - epochs: 4/5, steps: 7200/9590, func: 0.039326, pos: 0.026087, neg: 0.008004, 75%: 3h 13m 39s
05/12/2026 15:29:22 - INFO - root - epochs: 4/5, steps: 7250/9590, func: 0.038138, pos: 0.012713, neg: 0.008708, 75%: 3h 9m 35s
05/12/2026 15:33:29 - INFO - root - epochs: 4/5, steps: 7300/9590, func: 0.037897, pos: 0.011754, neg: 0.003533, 76%: 3h 5m 34s
05/12/2026 15:37:28 - INFO - root - epochs: 4/5, steps: 7350/9590, func: 0.037133, pos: 0.011942, neg: 0.01366, 76%: 3h 1m 29s
05/12/2026 15:41:27 - INFO - root - epochs: 4/5, steps: 7400/9590, func: 0.03792, pos: 0.004823, neg: 0.002933, 77%: 2h 57m 25s
05/12/2026 15:45:36 - INFO - root - epochs: 4/5, steps: 7450/9590, func: 0.039068, pos: 0.013343, neg: 0.002967, 77%: 2h 53m 24s
05/12/2026 15:49:33 - INFO - root - epochs: 4/5, steps: 7500/9590, func: 0.03732, pos: 0.014963, neg: 0.00197, 78%: 2h 49m 19s
05/12/2026 15:53:39 - INFO - root - epochs: 4/5, steps: 7550/9590, func: 0.038959, pos: 0.005567, neg: 0.008477, 78%: 2h 45m 17s
05/12/2026 15:57:39 - INFO - root - epochs: 4/5, steps: 7600/9590, func: 0.036912, pos: 0.013687, neg: 0.004179, 79%: 2h 41m 14s
05/12/2026 16:01:41 - INFO - root - epochs: 4/5, steps: 7650/9590, func: 0.039019, pos: 0.014941, neg: 0.004959, 79%: 2h 37m 10s
05/12/2026 16:05:49 - INFO - root - epochs: 5/5, steps: 7700/9590, func: 0.033764, pos: 0.005057, neg: 0.003659, 80%: 2h 33m 9s
05/12/2026 16:09:54 - INFO - root - epochs: 5/5, steps: 7750/9590, func: 0.029872, pos: 0.003296, neg: 0.00179, 80%: 2h 29m 6s
05/12/2026 16:13:57 - INFO - root - epochs: 5/5, steps: 7800/9590, func: 0.030226, pos: 0.004068, neg: 0.000308, 81%: 2h 25m 3s
05/12/2026 16:17:56 - INFO - root - epochs: 5/5, steps: 7850/9590, func: 0.028933, pos: 0.014822, neg: 0.004078, 81%: 2h 20m 59s
05/12/2026 16:21:59 - INFO - root - epochs: 5/5, steps: 7900/9590, func: 0.030283, pos: 0.003466, neg: 0.003448, 82%: 2h 16m 56s
05/12/2026 16:26:04 - INFO - root - epochs: 5/5, steps: 7950/9590, func: 0.029845, pos: 0.007248, neg: 0.001106, 82%: 2h 12m 54s
05/12/2026 16:30:08 - INFO - root - epochs: 5/5, steps: 8000/9590, func: 0.031504, pos: 0.005889, neg: 0.001085, 83%: 2h 8m 51s
05/12/2026 16:34:14 - INFO - root - epochs: 5/5, steps: 8050/9590, func: 0.031164, pos: 0.003975, neg: 0.00115, 83%: 2h 4m 49s
05/12/2026 16:38:12 - INFO - root - epochs: 5/5, steps: 8100/9590, func: 0.030783, pos: 0.003097, neg: 0.001654, 84%: 2h 0m 45s
05/12/2026 16:42:11 - INFO - root - epochs: 5/5, steps: 8150/9590, func: 0.030095, pos: 0.003739, neg: 0.001491, 84%: 1h 56m 41s
05/12/2026 16:46:10 - INFO - root - epochs: 5/5, steps: 8200/9590, func: 0.029816, pos: 0.007938, neg: 0.000674, 85%: 1h 52m 37s
05/12/2026 16:50:14 - INFO - root - epochs: 5/5, steps: 8250/9590, func: 0.031022, pos: 0.002691, neg: 0.002459, 86%: 1h 48m 35s
05/12/2026 16:54:18 - INFO - root - epochs: 5/5, steps: 8300/9590, func: 0.030573, pos: 0.006589, neg: 0.00105, 86%: 1h 44m 32s
05/12/2026 16:58:19 - INFO - root - epochs: 5/5, steps: 8350/9590, func: 0.030611, pos: 0.00893, neg: 0.001367, 87%: 1h 40m 29s
05/12/2026 17:02:25 - INFO - root - epochs: 5/5, steps: 8400/9590, func: 0.03111, pos: 0.006094, neg: 0.003428, 87%: 1h 36m 26s
05/12/2026 17:06:28 - INFO - root - epochs: 5/5, steps: 8450/9590, func: 0.031174, pos: 0.006896, neg: 0.001139, 88%: 1h 32m 23s
05/12/2026 17:10:33 - INFO - root - epochs: 5/5, steps: 8500/9590, func: 0.03143, pos: 0.010165, neg: 0.001063, 88%: 1h 28m 21s
05/12/2026 17:14:33 - INFO - root - epochs: 5/5, steps: 8550/9590, func: 0.02977, pos: 0.001661, neg: 0.000399, 89%: 1h 24m 17s
05/12/2026 17:18:38 - INFO - root - epochs: 5/5, steps: 8600/9590, func: 0.030976, pos: 0.002311, neg: 0.001703, 89%: 1h 20m 14s
05/12/2026 17:22:42 - INFO - root - epochs: 5/5, steps: 8650/9590, func: 0.029891, pos: 0.003819, neg: 0.002301, 90%: 1h 16m 12s
05/12/2026 17:26:45 - INFO - root - epochs: 5/5, steps: 8700/9590, func: 0.030551, pos: 0.001962, neg: 0.000672, 90%: 1h 12m 9s
05/12/2026 17:30:50 - INFO - root - epochs: 5/5, steps: 8750/9590, func: 0.03133, pos: 0.004639, neg: 0.015719, 91%: 1h 8m 6s
05/12/2026 17:34:52 - INFO - root - epochs: 5/5, steps: 8800/9590, func: 0.030877, pos: 0.003829, neg: 0.002129, 91%: 1h 4m 3s
05/12/2026 17:38:52 - INFO - root - epochs: 5/5, steps: 8850/9590, func: 0.029963, pos: 0.002877, neg: 0.000992, 92%: 1h 0m 0s
05/12/2026 17:42:59 - INFO - root - epochs: 5/5, steps: 8900/9590, func: 0.030816, pos: 0.003824, neg: 0.000756, 92%: 0h 55m 57s
05/12/2026 17:47:04 - INFO - root - epochs: 5/5, steps: 8950/9590, func: 0.031247, pos: 0.001944, neg: 0.005025, 93%: 0h 51m 54s
05/12/2026 17:51:06 - INFO - root - epochs: 5/5, steps: 9000/9590, func: 0.030236, pos: 0.004118, neg: 0.00032, 93%: 0h 47m 51s
05/12/2026 17:55:06 - INFO - root - epochs: 5/5, steps: 9050/9590, func: 0.030045, pos: 0.003925, neg: 0.006625, 94%: 0h 43m 48s
05/12/2026 17:59:11 - INFO - root - epochs: 5/5, steps: 9100/9590, func: 0.030309, pos: 0.007086, neg: 0.001351, 94%: 0h 39m 45s
05/12/2026 18:03:12 - INFO - root - epochs: 5/5, steps: 9150/9590, func: 0.030451, pos: 0.001467, neg: 0.004338, 95%: 0h 35m 42s
05/12/2026 18:07:17 - INFO - root - epochs: 5/5, steps: 9200/9590, func: 0.030429, pos: 0.007513, neg: 0.003631, 95%: 0h 31m 39s
05/12/2026 18:11:21 - INFO - root - epochs: 5/5, steps: 9250/9590, func: 0.030666, pos: 0.003335, neg: 0.003133, 96%: 0h 27m 37s
05/12/2026 18:15:25 - INFO - root - epochs: 5/5, steps: 9300/9590, func: 0.030095, pos: 0.006037, neg: 0.001205, 96%: 0h 23m 34s
05/12/2026 18:19:29 - INFO - root - epochs: 5/5, steps: 9350/9590, func: 0.030603, pos: 0.005101, neg: 0.001488, 97%: 0h 19m 31s
05/12/2026 18:23:32 - INFO - root - epochs: 5/5, steps: 9400/9590, func: 0.031185, pos: 0.00453, neg: 0.000905, 98%: 0h 15m 28s
05/12/2026 18:27:35 - INFO - root - epochs: 5/5, steps: 9450/9590, func: 0.031027, pos: 0.005386, neg: 0.00126, 98%: 0h 11m 25s
05/12/2026 18:31:37 - INFO - root - epochs: 5/5, steps: 9500/9590, func: 0.030854, pos: 0.007732, neg: 0.00126, 99%: 0h 7m 22s
05/12/2026 18:35:37 - INFO - root - epochs: 5/5, steps: 9550/9590, func: 0.031091, pos: 0.004284, neg: 0.000373, 99%: 0h 3m 19s
05/12/2026 18:47:11 - INFO - root - final eval loss: func: 0.051988, pos: 0.0564, neg: 0.048031
05/12/2026 18:47:11 - INFO - root - Saving model checkpoint to ../trained/codellama-7b-lora-safecoder/checkpoint-last
|